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sequence with other known nucleotide sequences using any of a number of well known sequence alignment 
programs and determining which PRO polypeptide -encoding nucleotide sequence fragmem(s) are novel. All of 
such PRO polypeptide -encoding nucleotide sequences are contemplated herein. Also contemplated are the PRO 
polypeptide fragments encoded by these nucleotide molecule fragments, preferably those PRO polypeptide 
fragments that comprise a binding site for an anti-PRO antibody. 
5 In another embodiment, the invention provides isolated PRO polypeptide encoded by any of the isolated 

nucleic acid sequences hereinabove identified. 

In a certain aspect, the invention concerns an isolated PRO polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more 
preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yei 

10 more preferably at least about 84% sequence identity, yet more preferably at least about 85% sequence identity, 
yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence 
identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% 
sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 
91 % sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least 

15 about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at 
least about 95% sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably 
at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more 
preferably at least about 99% sequence identity to a PRO polypeptide having a full-length amino acid sequence 
as disclosed herein, a full-length amino acid sequence lacking the signal peptide as disclosed herein or an 

20 extracellular domain of a transmembrane protein as disclosed herein. 

In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more 
preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet 
more preferably at least about 84% sequence identity, yet more preferably at least about 85 % sequence identity, 

25 yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence 
identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% 
sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 
91 % sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least 
about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at 

30 least about 95 % sequence identity, yet more preferably at least about 96 % sequence identity, yet more preferably 
at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more 
preferably at least about 99% sequence identity to an amino acid sequence encoded by any of the human protein 
cDNAs deposited with the ATCC as disclosed herein. 

In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 

35 sequence scoring at least about 80% positives, preferably at least about 81 % positives, more preferably at least 
about 82% positives, yet more preferably at least about 83% positives, yet more preferably at least about 84% 
positives, yet more preferably at least about 85% positives, yet more preferably at least about 86% positives. 
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yer more preferably ai least about 87% positives, yet more preferably at least about 88% positives, yet more 
preferably at least about 89% positives, yet more preferably at least about 90% positives, yet more preferably 
at least about 91 % positives, yei more preferably at least about 92% positives, yet more preferably at least about 
93% positives, yet more preferably at least about 94% positives, yet more preferably at least about 95% 
positives, yet more preferably at least about 96% positives, yet more preferably at least about 97% positives, 
5 yet more preferably at least about 98 % positives and yet more preferably at least about 99 % positives when 
compared with the amino acid sequence of a PRO polypeptide having a full-length amino acid sequence as 
disclosed herein, a full-length amino acid sequence lacking the signal peptide as disclosed herein or an 
extracellular domain of a transmembrane protein as disclosed herein. 

In a specific aspect, the invention provides an isolated PRO polypeptide without the N-terminal signal 

10 sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino 
acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein 
those processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding 
nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and recovering the PRO 
polypeptide from the cell culture. 

15 Another aspect the invention provides an isolated PRO polypeptide which is cither transmembrane 

domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and 
recovering the PRO polypeptide from the cell culture. 

20 Another embodiment of the present invention is directed to the use of a PRO polypeptide, or an agonist 

or antagonist thereof as hereinbefore described, or an anti-PRO antibody, for the preparation of a medicament 
useful in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or antagonist 
thereof or an anti-PRO antibody. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a nucleotide sequence (SEQ ID NO: 1) of a native sequence PR0281 (UNQ244) cDNA, 
wherein SEQ ID NO:l is a clone designated herein as "DNA 16422- 1209". 

Figure 2 shows the amino acid sequence (SEQ ID NO:2) derived from the coding sequence of SEQ ID 
NO:l shown in Figure 1. 

30 Figure 3 shows a nucleotide sequence (SEQ ID NO:5) of a native sequence PR0276 (UNQ243) cDNA, 

wherein SEQ ID NO:5 is a clone designated herein as "DNA1 6435- 1208". 

Figure 4 shows the amino acid sequence (SEQ ID NO: 6) derived from the coding sequence of SEQ ID 
NO: 5 shown in Figure 3. 

Figure 5 shows a nucleotide sequence (SEQ ID NO:7) of a native sequence PR0189 (UNQ163) cDNA, 
35 wherein SEQ ID NO:7 is a clone designated herein as "DNA2 1642- 1391". 

Figure 6 shows the amino acid sequence (SEQ ID NO:8) derived from the coding sequence of SEQ ID 
NO: 7 shown in Figure 5. 
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Figure 7 shows a nucleotide sequence designated herein as DN A 14187 (SEQ ID NO:9). 
Figure 8 shows a nucleotide sequence (SEQ ID NO: 13) of a native sequence PRO 1 90 (UNQ 164) cDNA, 
wherein SEQ ID NO: 13 is a clone designated herein as "DNA23334-1392". 

Figure 9 shows the amino acid sequence (SEQ ID NO: 14) derived from the coding sequence of SEQ 
ID NO: 13 shown in Figure 8. 
5 Figure 10 shows a nucleotide sequence designated herein as DNA 14232 (SEQ ID NO: 15). 

Figure 11 shows a nucleotide sequence (SEQ ID NO: 19) of a native sequence PR0341 (UNQ300) 
cDNA, wherein SEQ ID NO: 19 is a clone designated herein as "DNA26288-I239". 

Figure 12 shows the amino acid sequence (SEQ ID NO:20) derived from the coding sequence of SEQ 
ID NO: 19 shown in Figure 11. 
10 Figure 13 shows a nucleotide sequence designated herein as DNA12920 (SEQ ID NO:21). 

Figure 14 shows a nucleotide sequence (SEQ ID NO:22) of a native sequence PRO180 (UNQ154) 
cDNA, wherein SEQ ID NO:22 is a clone designated herein as "DNA26843-1389". 

Figure 15 shows the amino acid sequence (SEQ ID NO: 23) derived from the coding sequence of SEQ 
ID NO: 22 shown in Figure 14. 
15 Figure 16 shows a nucleotide sequence designated herein as DNA 12922 (SEQ ID NO:24). 

Figure 17 shows a nucleotide sequence (SEQ ID NO:27) of a native sequence PRO 194 (UNQ 168) 
cDNA, wherein SEQ ID NO:27 is a clone designated herein as "DNA26844-1394". 

Figure 18 shows the amino acid sequence (SEQ ID NO:28) derived from the coding sequence of SEQ 
ID NO: 27 shown in Figure 17. 
20 Figure 19 shows a nucleotide sequence (SEQ ID NO:29) of a native sequence PRO203 (UNQ177) 

cDNA, wherein SEQ ID NO:29 is a clone designated herein as "DNA30862-I396". 

Figure 20 shows the amino acid sequence (SEQ ID NO:30) derived from the coding sequence of SEQ 
ID NO:29 shown in Figure 19. 

Figure 21 shows a nucleotide sequence designated herein as DNA 1561 8 (SEQ ID NO:31). 
25 Figure 22 shows a nucleotide sequence (SEQ ID NO:32) of a native sequence PRO290 (UNQ253) 

cDNA, wherein SEQ ID NO: 32 is a clone designated herein as "DNA35680-1212". 

Figure 23 shows the amino acid sequence (SEQ ID NO:33) derived from the coding sequence of SEQ 
ID NO:32 shown in Figure 22. 

Figure 24 shows a nucleotide sequence (SEQ ID NO:35) of a native sequence PR0874 (UNQ441) 
30 cDNA, wherein SEQ ID NO:35 is a clone designated herein as "DNA4062 1-1440". 

Figure 25 shows the amino acid sequence (SEQ ID NO: 36) derived from the coding sequence of SEQ 
ID NO:35 shown in Figure 24. 

Figure 26 shows a nucleotide sequence (SEQ ID NO:40) of a native sequence PRO710 (UNQ374) 
cDNA, wherein SEQ ID NO:40 is a clone designated herein as "DNA44 16 1-1434". 
35 Figure 27 shows the amino acid sequence (SEQ ID NO:41) derived from the coding sequence of SEQ 

ID NO:40 shown in Figure 26. 

Figure 28 shows a nucleotide sequence designated herein as DNA38I90 (SEQ ID NO:42). 
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Figure 29 shows a nucleotide sequence (SEQ ID NO:46) of a native sequence PR01151 (UNQ581) 
cDNA, wherein SEQ ID NO:46 is a clone designated herein as "DNA44694-1500". 

Figure 30 shows the amino acid sequence (SEQ ID NO:47) derived from the coding sequence of SEQ 
ID NO:46 shown in Figure 29. 

Figure 31 shows a nucleotide sequence (SEQ ID NO:51) of a native sequence PR01282 (UNQ652) 
5 cDNA, wherein SEQ ID NO:51 is a clone designated herein as "DNA45495-1550". 

Figure 32 shows the amino acid sequence (SEQ ID NO:52) derived from the coding sequence of SEQ 
ID NO:51 shown in Figure 31. 

Figure 33 shows a nucleotide sequence (SEQ ID NO:56) of a native sequence PR0358 cDNA, wherein 
SEQ ID NO:56 is a clone designated herein as "DNA47361-1 154. 
10 Figure 34 shows the amino acid sequence (SEQ ID NO:57) derived from the coding sequence of SEQ 

ID NO: 56 shown in Figure 33. 

Figures 35A-B show a nucleotide sequence (SEQ ID NO:61) of a native sequence PRO1310 cDNA, 
wherein SEQ ID NO:61 is a clone designated herein as "DNA47394-1572. 

Figure 36 shows the amino acid sequence (SEQ ID NO:62) derived from the coding sequence of SEQ 
15 ID NO:6l shown in Figures 35A-B. 

Figure 37 shows a nucleotide sequence (SEQ ID NO:66) of a native sequence PR0698 (UNQ362) 
cDNA, wherein SEQ ID NO: 66 is a clone designated herein as "DNA48320-1433V 

Figure 38 shows the amino acid sequence (SEQ ID NO: 67) derived from the coding sequence of SEQ 
ID NO:66 shown in Figure 37. 
20 Figure 39 shows a nucleotide sequence designated herein as DNA39906 (SEQ ID NO:68). 

Figure 40 shows a nucleotide sequence (SEQ ID NO:72) of a native sequence PR0732 (UNQ396) 
cDNA, wherein SEQ ID NO: 72 is a clone designated herein as "DNA48334-1435". 

Figure 41 shows the amino acid sequence (SEQ ID NO:73) derived from the coding sequence of SEQ 
ID NO:72 shown in Figure 40. 
25 Figure 42 shows a nucleotide sequence designated herein as DNA20239 (SEQ ID NO:74). 

Figure 43 shows a nucleotide sequence designated herein as DNA38050 (SEQ ID NO:75). 

Figure 44 shows a nucleotide sequence designated herein as DNA40683 (SEQ ID NO:76). 

Figure 45 shows a nucleotide sequence designated herein as DNA42580 (SEQ ID NO:77). 

Figures 46A-B show a nucleotide sequence (SEQ ID NO: 83) of a native sequence PRO 1 120 (UNQ559) 
30 cDNA, wherein SEQ ID NO:83 is a clone designated herein as "DNA48606-I479". 

Figure 47 shows the amino acid sequence (SEQ ID NO: 84) derived from the coding sequence of SEQ 
ID NO: 83 shown in Figures 46A-B. 

Figure 48 shows a nucleotide sequence (SEQ ID NO:94) of a native sequence PR0537 (UNQ338) 
cDNA, wherein SEQ ID NO:94 is a clone designated herein as "DNA4914 1-1431". 
35 Figure 49 shows the amino acid sequence (SEQ ID NO:95) derived from the coding sequence of SEQ 

ID NO:94 shown in Figure 48. 

Figure 50 shows a nucleotide sequence (SEQ ID NO: 96) of a native sequence PR0536 (UNQ337) 
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cDNA, wherein SEQ ID NO:96 is a clone designated herein as "DNA49 142- 1430". 

Figure 51 shows the amino acid sequence (SEQ ID NO:97) derived from the coding sequence of SEQ 
ID NO: 96 shown in Figure 50. 

Figure 52 shows a nucleotide sequence (SEQ ID NO:98) of a native sequence PR0535 (UNQ336) 
cDNA, wherein SEQ ID NO:98 is a clone designated herein as "DNA49143-1429". 
5 Figure 53 shows the amino acid sequence (SEQ ID NO:99) derived from the coding sequence of SEQ 

ID NO: 98 shown in Figure 52. 

Figure 54 shows a nucleotide sequence designated herein as DNA30861 (SEQ ID NO: 100). 
Figure 55 shows a nucleotide sequence designated herein as DNA36351 (SEQ ID NO:101). 
Figure 56 shows a nucleotide sequence (SEQ ID NO: 102) of a native sequence PR07I8 (UNQ386) 
10 cDNA, wherein SEQ ID NO: 102 is a clone designated herein as "DNA49647- 1398 V 

Figure 57 shows the amino acid sequence (SEQ ID NO: 103) derived from the coding sequence of SEQ 
ID NO: 102 shown in Figure 56. 

Figure 58 shows a nucleotide sequence designated herein as DNA 15386 (SEQ ID NO: 104). 
Figure 59 shows a nucleotide sequence designated herein as DNA16630 (SEQ ID NO: 105). 
15 Figure 60 shows a nucleotide sequence designated herein as DNA 16829 (SEQ ID NO: 106). 

Figure 61 shows a nucleotide sequence designated herein as DNA28357 (SEQ ID NO: 107). 
Figure 62 shows a nucleotide sequence designated herein as DNA43512 (SEQ ID NO: 108). 
Figure 63 shows a nucleotide sequence (SEQ ID NO: 112) of a native sequence PR0872 (UNQ439) 
cDNA, wherein SEQ ID NO: 1 12 is a clone designated herein as 'DNA498 19- 1439 V 
20 Figure 64 shows the amino acid sequence (SEQ ID NO: 1 1 3) derived from the coding sequence of SEQ 

ID NO: 1 12 shown in Figure 63. 

Figure 65 shows a nucleotide sequence (SEQ ID NO:114) of a native sequence PRO1063 (UNQ128) 
cDNA, wherein SEQ ID NO: 1 14 is a clone designated herein as "DNA49820-1427V 

Figure 66 shows the amino acid sequence (SEQ ID NO: 1 15) derived from the coding sequence of SEQ 
25 ID NO: 114 shown in Figure 65 . 

Figure 67 shows a nucleotide sequence (SEQ ID NO: 116) of a native sequence PR0619 (UNQ355) 
cDNA„ wherein SEQ ID NO: 116 is a clone designated herein as 'DNA4982 1-1 562 V 

Figure 68 shows the amino acid sequence (SEQ ID NO: 1 1 7) derived from the coding sequence of SEQ 
ID NO: 116 shown in Figure 67. 
30 Figure 69 shows a nucleotide sequence (SEQ ID NO: 1 18) of a native sequence PR0943 (UNQ480) 

cDNA, wherein SEQ ID NO:118 is a clone designated herein as "DNA52192-I369V 

Figure 70 shows the amino acid sequence (SEQ ID NO: 119) derived from the coding sequence of SEQ 
ID NO: 118 shown in Figure 69. 

Figure 71 shows a nucleotide sequence (SEQ ID NO: 123) of a native sequence PROl 188 (UNQ602) 
35 cDNA, wherein SEQ ID NO: 123 is a clone designated herein as "DNA52598-1518V 

Figure 72 shows the amino acid sequence (SEQ ID NO: 124) derived from the coding sequence of SEQ 
ID NO: 123 shown in Figure 71 . 
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Figure 73 shows a nucleotide sequence (SEQ ID NO: 128) of a native sequence PROH33 (UNQ571) 
cDNA, wherein SEQ ID NO: 128 is a clone designated herein as "DNA53913-1490". 

Figure 74 shows the amino acid sequence (SEQ ID NO: 129) derived from the coding sequence of SEQ 
ID NO: 128 shown in Figure 73. 

Figure 75 shows a nucleotide sequence (SEQ ID NO: 134) of a native sequence PR0784 (UNQ459) 
5 cDNA, wherein SEQ ID NO: 134 is a clone designated herein as "DNA53978-1443". 

Figure 76 shows the amino acid sequence (SEQ ID NO: 135) derived from the coding sequence of SEQ 
ID NO: 134 shown in Figure 75. 

Figure 77 shows a nucleotide sequence designated herein as DNA44661 (SEQ ID NO: 136). 

Figure 78 shows a nucleotide sequence (SEQ ID NO: 137) of a native sequence PR0783 (UNQ458) 
10 cDNA, wherein SEQ ID NO: 137 is a clone designated herein as "DNA53996-1442V 

Figure 79 shows the amino acid sequence (SEQ ID NO: 138) derived from the coding sequence of SEQ 
ID NO: 137 shown in Figure 78. 

Figure 80 shows a nucleotide sequence designated herein as DNA45201 (SEQ ID NO: 139). 

Figure 81 shows a nucleotide sequence designated herein as DN A 14575 (SEQ ID NO: 140). 
15 Figure 82 shows a nucleotide sequence (SEQ ID NO: 145) of a native sequence PRO820 (UNQ503) 

cDNA, wherein SEQ ID NO:145 is a clone designated herein as "DNA5604I-1416". 

Figure 83 shows the amino acid sequence (SEQ ID NO: 146) derived from the coding sequence of SEQ 
ID NO: 145 shown in Figure 82. 

Figure 84 shows a nucleotide sequence (SEQ ID NO: 147) of a native sequence PRO1080 (UNQ537) 
20 cDNA, wherein SEQ ID NO: 147 is a clone designated herein as "DNA56047-]456\ 

Figure 85 shows the amino acid sequence (SEQ ID NO: 148) derived from the coding sequence of SEQ 
ID NO: 147 shown in Figure 84. 

Figure 86 shows a nucleotide sequence designated herein as DNA36527 (SEQ ID NO: 149). 

Figure 87 shows a nucleotide sequence (SEQ ID NO: 150) of a native sequence PROI079 (UNQ536) 
25 cDNA, wherein SEQ ID NO: 150 is a clone designated herein as "DNA56050-1455". 

Figure 88 shows the amino acid sequence (SEQ ID NO: 151) derived from the coding sequence of SEQ 
ID NO: 150 shown in Figure 87. 

Figure 89 shows a nucleotide sequence (SEQ ID NO: 152) of a native sequence PR0793 (UNQ432) 
cDNA, wherein SEQ ID NO: 152 is a clone designated herein as "DNA561 10-1437". 
30 Figure 90 shows the amino acid sequence (SEQ ID NO; 153) derived from the coding sequence of SEQ 

ID NO: 152 shown in Figure 89. 

Figure 91 shows a nucleotide sequence designated herein as DNA50177 (SEQ ID NO: 154). 

Figure 92 shows a nucleotide sequence (SEQ ID NO:I55) of a native sequence PRO10I6 (UNQ499) 
cDNA, wherein SEQ ID NO:155 is a clone designated herein as "DNA561 13-1378". 
35 Figure 93 shows the amino acid sequence (SEQ ID NO: 156) derived from the coding sequence of SEQ 

ID NO: 155 shown in Figure 92. 
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Figure 94 shows a nucleotide sequence (SEQ ID NO: 157) of a native sequence PRO1013 (UNQ496) 
cDNA, wherein SEQ ID NO: 157 is a clone designated herein as "DNA56410-1414". 

Figure 95 shows the amino acid sequence (SEQ ID NO: 158) derived from the coding sequence of SEQ 
ID NO: 157 shown in Figure 94. 

Figure 96 shows a nucleotide sequence (SEQ ID NO: 159) of a native sequence PR0937 (UNQ474) 
5 cDNA, wherein SEQ ID NO: 159 is a clone designated herein as "DNA564 36-1448". 

Figure 97 shows the amino acid sequence (SEQ ID NO: 160) derived from the coding sequence of SEQ 
ID NO: 159 shown in Figure 96. 

Figure 98 shows a nucleotide sequence (SEQ ID NO: 164) of a native sequence PR0842 (UNQ473) 
cDNA, wherein SEQ ID NO: 164 is a clone designated herein as "DNA56855-I447". 
10 Figure 99 shows the amino acid sequence (SEQ ID NO: 165) derived from the coding sequence of SEQ 

ID NO: 1 64 shown in Figure 98. 

Figure 100 shows a nucleotide sequence (SEQ ID NO: 166) of a native sequence PR0839 (UNQ472) 
cDNA, wherein SEQ ID NO: 166 is a clone designated herein as "DNA56859-1445". 

Figure 101 shows the amino acid sequence (SEQ ID NO: 1 67) derived from the coding sequence of SEQ 
15 ID NO: 166 shown in Figure 100. 

Figure 102 shows a nucleotide sequence (SEQ ID NO: 168) of a native sequence PROl 180 (UNQ594) 
cDNA, wherein SEQ ID NO: 168 is a clone designated herein as "DNA5686O-1510". 

Figure 103 shows the amino acid sequence (SEQ ID NO: 169) derived from the coding sequence of SEQ 
ID NO: 168 shown in Figure 102. 
20 Figure 104 shows a nucleotide sequence (SEQ ID NO: 170) of a native sequence PROl 134 (UNQ572) 

cDNA, wherein SEQ ID NO: 170 is a clone designated herein as "DNA56865-149K . 

Figure 105 shows the amino acid sequence (SEQ ID NO: 171) derived from the coding sequence of SEQ 
ID NO: 170 shown in Figure 104. 

Figure 106 shows a nucleotide sequence designated herein as DNA52352 (SEQ ID NO: 172). 
25 Figure 107 shows a nucleotide sequence designated herein as DNA55725 (SEQ ID NO: 173). 

Figure 108 shows a nucleotide sequence (SEQ ID NO: 174) of a native sequence PRO830 (UNQ470) 
cDNA, wherein SEQ ID NO: 174 is a clone designated herein as "DNA56866-1342". 

Figure 109 shows the amino acid sequence (SEQ ID NO: 175) derived from the coding sequence of SEQ 
ID NO: 174 shown in Figure 108. 
30 Figure 1 10 shows a nucleotide sequence (SEQ ID NO: 176) of a native sequence PROl 1 15 (UNQ558) 

cDNA, wherein SEQ ID NO: 176 is a clone designated herein as "DNA56868-1478". 

Figure 1 1 1 shows the amino acid sequence (SEQ ID NO: 177) derived from the coding sequence of SEQ 
ID NO: 176 shown in Figure 1 10. 

Figure 112 shows a nucleotide sequence (SEQ ID NO: 178) of a native sequence PRO 1277 (UNQ647) 
35 cDNA, wherein SEQ ID NO:178 is a clone designated herein as "DNA56869-I545". 

Figure 1 13 shows the amino acid sequence (SEQ ID NO: 179) derived from the coding sequence of SEQ 
ID NO: 178 shown in Figure 112. 
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Figure 1 14 shows a nucleotide sequence (SEQ ID NO: 180) of a native sequence PRO 1 135 (UNQ573) 
cDNA, wherein SEQ ID NO: 180 is a clone designated herein as "DNA56870-1492*. 

Figure 1 15 shows the amino acid sequence (SEQ ID NO: 181) derived from the coding sequence of SEQ 
ID NO: 180 shown in Figure 114. 

Figure 1 16 shows a nucleotide sequence (SEQ ID NO: 182) of a native sequence PRO 11 14 (UNQ557) 
5 cDNA, wherein SEQ ID NO: 182 is a clone designated herein as "DNA57033-1403V 

Figure 1 17 shows the amino acid sequence (SEQ ID NO: 1 83) derived from the coding sequence of SEQ 
ID NO: 182 shown in Figure 116. 

Figure 118 shows a nucleotide sequence designated herein as DNA48466 (SEQ ID NO: 184). 

Figure 119 shows a nucleotide sequence (SEQ ID NO: 188) of a native sequence PR0828 (UNQ469) 
10 cDNA, wherein SEQ ID NO: 188 is a clone designated herein as "DNA57037-1444". 

Figure 120 shows the amino acid sequence (SEQ ID NO: 1 89) derived from the coding sequence of SEQ 
ID NO: 188 shown in Figure 119. 

Figure 121 shows a nucleotide sequence (SEQ ID NO: 193) of a native sequence PRO1009 (UNQ493) 
cDNA, wherein SEQ ID NO: 193 is a clone designated herein as "DNA57129-1413". 
1 5 Figure 1 22 shows the amino acid sequence (SEQ ID NO: 1 94) derived from the coding sequence of SEQ 

ID NO: 193 shown in Figure 121. 

Figure 123 shows a nucleotide sequence designated herein as DNA50853 (SEQ ID NO: 195). 

Figure 124 shows a nucleotide sequence (SEQ ID NO: 196) of a native sequence PRO 1007 (UNQ491) 
cDNA, wherein SEQ ID NO: 196 is a clone designated herein as "DNA57690-1374\ 
20 Figure 1 25 shows the amino acid sequence (SEQ ID NO: 1 97) derived from the coding sequence of SEQ 

ID NO: 196 shown in Figure 124. 

Figure 126 shows a nucleotide sequence (SEQ ID NO: 198) of a native sequence PRO1056 (UNQ521) 
cDNA, wherein SEQ ID NO: 198 is a clone designated herein as "DNA57693- 1424V 

Figure 127 shows the amino acid sequence (SEQ ID NO: 199) derived from the coding sequence of SEQ 
25 ID NO: 198 shown in Figure 126. 

Figure 128 shows a nucleotide sequence (SEQ ID NO:200) of a native sequence PR0826 (UNQ467) 
cDNA. wherein SEQ ID NO:200 is a clone designated herein as "DNA57694-1341". 

Figure 129 shows the amino acid sequence (SEQ ID NO:201) derived from the coding sequence of SEQ 
ID NO: 200 shown in Figure 128. 
30 Figure 130 shows a nucleotide sequence (SEQ ID NO: 202) of a native sequence PR0819 (UNQ466) 

cDNA, wherein SEQ ID N0:202 is a clone designated herein as "DNA57695-1340". 

Figure 131 shows the amino acid sequence (SEQ ID NO:203) derived from the coding sequence of SEQ 
ID NO:202 shown in Figure 130. 

Figure 132 shows a nucleotide sequence (SEQ ID NO:204) of a native sequence PRO 1006 (UNQ490) 
35 cDNA, wherein SEQ ID NO:204 is a clone designated herein as "*DNA57699-1412". 

Figure 133 shows the amino acid sequence (SEQ ID NO:205) derived from the coding sequence of SEQ 
ID NO:204 shown m Figure 132. 
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Figure 134 shows a nucleotide sequence (SEQ ID NO:206) of a native sequence PROl 1 12 (UNQ555) 
cDNA, wherein SEQ ID NO:206 is a clone designated herein as 'DNA57702-1476V 

Figure 135 shows the amino acid sequence (SEQ ID NO:207) derived from the coding sequence of SEQ 
ID NO:206 shown in Figure 134. 

Figure 136 shows a nucleotide sequence (SEQ ID NO:208) of a native sequence PRO1074 (UNQ531) 
5 cDNA, wherein SEQ ID NO:208 is a clone designated herein as "DNA57704-1452V 

Figure 137 shows the amino acid sequence (SEQ ID NO:209) derived from the coding sequence of SEQ 
ID NO:208 shown in Figure 136. 

Figure 138 shows a nucleotide sequence (SEQ ID NO:210) of a native sequence PRO 1005 (UNQ489) 
cDNA. wherein SEQ ID NO:210 is a clone designated herein as "DNA57708-1005", 
10 Figure 139 shows the amino acid sequence (SEQ ID NO: 21 1) derived from the coding sequence of SEQ 

ID NO:210 shown in Figure 138. 

Figure 140 shows a nucleotide sequence (SEQ ID NO :2 12) of a native sequence PRO 1073 (UNQ530) 
cDNA, wherein SEQ ID NO:212 is a clone designated herein as "DNA57710-1451". 

Figure 141 shows the amino acid sequence (SEQ ID NO:213) derived from the coding sequence of SEQ 
15 ID NO:212 shown in Figure 140. 

Figure 142 shows a nucleotide sequence designated herein as DNA55938 (SEQ ID NO:2l4). 
Figure 143 shows a nucleotide sequence (SEQ ID NO:215) of a native sequence PROU52 (UNQ582) 
cDNA, wherein SEQ ID NO:215 is a clone designated herein as "DNA5771 1-1501". 

figure 144 shows the amino acid sequence (SEQ ID NO:216) derived from the coding sequence of SEQ 
20 ID NO:215 shown in Figure 143. 

Figure 145 shows a nucleotide sequence designated herein as DNA55807 (SEQ ID NO:217). 
Figure 146 shows a nucleotide sequence (SEQ ID NO:218) of a native sequence PROI 136 (UNQ574) 
cDNA, wherein SEQ ID NO:218 is a clone designated herein as "DNA57827-1493". 

Figure 147 shows the amino acid sequence (SEQ ID NO:219) derived from the coding sequence of SEQ 
25 ID NO:218 shown in Figure 146. 

Figure 148 shows a nucleotide sequence (SEQ ID NO:220) of a native sequence PR0813 (UNQ465) 
cDNA, wherein SEQ ID NO:220 is a clone designated herein as "DNA57834-1339\ 

Figure 149 shows the amino acid sequence (SEQ ID NO:221) derived from the coding sequence of SEQ 
ID NO:220 shown in Figure 148. 
30 Figure 150 shows a nucleotide sequence (SEQ ID NO:222) of a native sequence PRO809 (UNQ464) 

cDNA, wherein SEQ ID NO:222 is a clone designated herein as "DNA57836-1338". 

Figure 151 shows the amino acid sequence (SEQ ID NO: 223) derived from the coding sequence of SEQ 
ID NO:222 shown in Figure 150. 

Figure 152 shows a nucleotide sequence (SEQ ID NO:224) of a native sequence PR0791 (UNQ463) 
35 cDNA, wherein SEQ ID NO:224 is a clone designated herein as "DNA57838-1337\ 

Figure 153 shows the amino acid sequence (SEQ ID NO:225) derived from the coding sequence of SEQ 
ID NO:224 shown in Figure 152. 
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Figure 154 shows a nucleotide sequence (SEQ ID NO:226) of a native sequence PRO 1004 (UNQ488) 
cDNA, wherein SEQ ID NO:226 is a clone designated herein as 'DNA57844-1410". 

Figure 155 shows the amino acid sequence (SEQ ID NO:227) derived from the coding sequence of SEQ 
ID NO: 226 shown in Figure 154. 

Figure 156 shows a nucleotide sequence (SEQ ID NO:228) of a native sequence PROll 1 1 (UNQ554) 
5 cDNA, wherein SEQ ID NO:228 is a clone designated herein as "DNA5872 1-1475". 

Figure 157 shows the amino acid sequence (SEQ ID NO:229) derived from the coding sequence of SEQ 
ID N 0:228 shown in Figure 156. 

Figure 158 shows a nucleotide sequence (SEQ ID NO:230) of a native sequence PR01344 (UNQ699) 
cDNA, wherein SEQ ID NO:230 is a clone designated herein as "DNA58723-1588". 
10 Figure 159 shows the amino acid sequence (SEQ ID NO: 23 1) derived from the coding sequence of SEQ 

ID NO:230 shown in Figure 158. 

Figure 160 shows a nucleotide sequence (SEQ ID NO:235) of a native sequence PRO1109 (UNQ552) 
cDNA, wherein SEQ ID NO:235 is a clone designated herein as "DNA58737-1473". 

Figure 161 shows the amino acid sequence (SEQ ID NO:236) derived from the coding sequence of SEQ 
15 ID NO:235 shown in Figure 160. 

Figure 162 shows a nucleotide sequence (SEQ ID NO:240) of a native sequence PR01383 (UNQ719) 
cDNA, wherein SEQ ID NO:240 is a clone designated herein as "DNA58743-1609". 

Figure 163 shows the amino acid sequence (SEQ ID NO: 24 1 ) derived from the coding sequence of SEQ 
ID N0:24C shown in Figure 162. 
20 Figure 164 shows a nucleotide sequence (SEQ ID NO . 245) of a native sequence PRO1003 (UNQ487) 

cDNA, wherein SEQ ID NO:245 is a clone designated herein as "DNA58846-1409". 

Figure 165 shows the amino acid sequence (SEQ ID NO: 246) derived from the coding sequence of SEQ 
ID NO: 245 shown in Figure 164. 

Figure 166 shows a nucleotide sequence (SEQ ID NO:247) of a native sequence PRO1108 (UNQ551) 
25 cDNA, wherein SEQ ID NO:247 is a clone designated herein as "DNA58848-1472" . 

Figure 167 shows the amino acid sequence (SEQ ID NO: 248) derived from the coding sequence of SEQ 
ID NO:247 shown in Figure 166. 

Figure 168 shows a nucleotide sequence (SEQ ID NO:249) of a native sequence PR01137 (UNQ575) 
cDNA, wherein SEQ ID N0:249 is a clone designated herein as "DNA58849-1494". 
30 Figure 1 69 shows the amino acid sequence (SEQ ID NO :250) derived from the coding sequence of SEQ 

ID NO:249 shown in Figure 168. 

Figure 170 shows a nucleotide sequence (SEQ ID NO:252) of a native sequence PROl 138 (UNQ576) 
cDNA, wherein SEQ ID NO: 252 is a clone designated herein as "DNA58850-1495V 

Figure 171 shows the amino acid sequence (SEQ ID NO:253) derived from the coding sequence of SEQ 
35 ID NO:252 shown in Figure 170. 

Figure 172 shows a nucleotide sequence designated herein as DNA49140 (SEQ ID NO:254). 



292 



WO 99/63088 



PCT/US99/12252 



Figure 173 shows a nucleotide sequence (SEQ ID NO:255) of a native sequence PRO1054 (UNQ519) 
cDNA, wherein SEQ ID NO:255 is a clone designated herein as "DNA58853- 1423 V 

Figure 174 shows the amino acid sequence (SEQ ID NO:256) derived from the coding sequence of SEQ 
ID NO;255 shown in Figure 173. 

Figure 175 shows a nucleotide sequence (SEQ ID NO:257) of a native sequence PR0994 (UNQ518) 
5 cDNA, wherein SEQ ID NO: 257 is a clone designated herein as "DNA58855-1422". 

Figure 176 shows the amino acid sequence (SEQ ID NO:258) derived from the coding sequence of SEQ 
ID NO:257 shown in Figure 175. 

Figure 177 shows a nucleotide sequence (SEQ ID NO: 259) of a native sequence PR0812 (UNQ5I7) 
cDNA, wherein SEQ ID NO:259 is a clone designated herein as 'DNA59205-1421". 
10 Figure 1 78 shows the amino acid sequence (SEQ ID NO:260) derived from the coding sequence of SEQ 

ID NO:259 shown in Figure 177. 

Figure 179 shows a nucleotide sequence (SEQ ID NO:261) of a native sequence PRO1069 (UNQ526) 
cDNA, wherein SEQ ID NO:261 is a clone designated herein as "DNA5921 1-1450". 

Figure 180 shows the amino acid sequence (SEQ ID NO:262) derived from the coding sequence of SEQ 
15 ID NO:261 shown in Figure 179. 

Figure 181 shows a nucleotide sequence (SEQ ID NO:263) of a native sequence PROl 129 (UNQ568) 
cDNA, wherein SEQ ID NO:263 is a clone designated herein as "DNA59213-1487". 

Figure 1 82 shows the amino acid sequence (SEQ ID NO :264) derived from the coding sequence of SEQ 
ID NO: 263 shown in Figure 181. 
20 Figure 183 shows a nucleotide sequence (SEQ ID NO:265) of a native sequence PRO1068 (UNQ525) 

cDNA, wherein SEQ ID NO:265 is a clone designated herein as "DNA59214-1449". 

Figure 184 shows the arnino acid sequence (SEQ ID NO:266) derived from the coding sequence of SEQ 
ID NO: 265 shown in Figure 183. 

Figure 185 shows a nucleotide sequence (SEQ ID NO:267) of a native sequence PRO1066 (UNQ524) 
25 cDNA„ wherein SEQ ID NO:267 is a clone designated herein as "DNA59215-1425*. 

Figure 186 shows the amino acid sequence (SEQ ID NO:268) derived from the coding sequence of SEQ 
ID NO: 267 shown in Figure 185. 

Figure 187 shows a nucleotide sequence (SEQ ID NO:269) of a native sequence PROl 184 (UNQ598) 
cDNA, wherein SEQ ID NO;269 is a clone designated herein as "DNA59220-1514". 
30 Figure 188 shows the amino acid sequence (SEQ ID NO:270) derived from the coding sequence of SEQ 

ID NO:269 shown in Figure 187. 

Figure 189 shows a nucleotide sequence (SEQ ID NO:271) of a native sequence PRO 1360 (UNQ709) 
cDNA, wherein SEQ ID NO:271 is a clone designated herein as "DNA59488-1603". 

Figure 190 shows the amino acid sequence (SEQ ID NO:272) derived from the coding sequence of SEQ 
35 ID NO:271 shown in Figure 189. 

Figure 191 shows a nucleotide sequence (SEQ ID NO;273) of a native sequence PROI029 (UNQ514) 
cDNA, wherein SEQ ID NO:273 is a clone designated herein as "DNA59493-1420". 
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Figure 192 shows the amino acid sequence (SEQ ID NO:274) derived from the coding sequence of SEQ 
ID NO:273 shown in Figure 191. 

Figure 193 shows a nucleotide sequence (SEQ ID NO;275) of a native sequence PR01139 (UNQ577) 
cDNA, wherein SEQ ID NO:275 is a clone designated herein as "DNA59497-1496\ 

Figure 194 shows the amino acid sequence (SEQ ID NO:276) derived from the coding sequence of SEQ 
ID NO:275 shown in Figure 193. 

Figure 195 shows a nucleotide sequence (SEQ ID NO:277) of a native sequence PROI309 (UNQ675) 
cDNA, wherein SEQ ID NO:277 is a clone designated herein as "DNA59588-1571". 

Figure 196 shows the amino acid sequence (SEQ ID NO:278) derived from the coding sequence of SEQ 
ID NO:277 shown in Figure 195. 

Figure 197 shows a nucleotide sequence (SEQ ID NO:280) of a native sequence PRO 1028 (UNQ513) 
cDNA, wherein SEQ ID NO.280 is a clone designated herein as "DNA59603-1419". 

Figure 198 shows the amino acid sequence (SEQ ID NO:28l) derived from the coding sequence of SEQ 
ID NO:280 shown in Figure 197. 

Figure 199 shows a nucleotide sequence (SEQ ID NO:282) of a native sequence PRO1027 (UNQ512) 
cDNA, wherein SEQ ID NO:282 is a clone designated herein as "DNA59605-1418". 

Figure 200 shows the amino acid sequence (SEQ ID NO:283) derived from the coding sequence of SEQ 
ID NO:282 shown in Figure 199. 

Figure 201 shows a nucleotide sequence (SEQ ID NO:284) of a native sequence PROl 107 (UNQ550) 
cDNA, wherein SEQ ID NO:284 is a clone designated herein as "DNA59606-1471\ 

Figure 202 shows the amino acid sequence (SEQ ID NO:285) derived from the coding sequence of SEQ 
ID NO:284 shown in Figure 201. 

Figure 203 shows a nucleotide sequence (SEQ ID NO:286) of a native sequence PROl 140 (UNQ578) 
cDNA, wherein SEQ ID NO:286 is a clone designated herein as "DNA59607-1497". 

Figure 204 shows the amino acid sequence (SEQ ID NO: 287) derived from the coding sequence of SEQ 
ID NO:286 shown in Figure 203. 

Figure 205 shows a nucleotide sequence (SEQ ID NO:288) of a native sequence PROl 106 (UNQ549) 
cDNA, wherein SEQ ID NO:288 is a clone designated herein as "DNA59609-1470". 

Figure 206 shows the amino acid sequence (SEQ ID NO:289) derived from the coding sequence of SEQ 
ID NO: 288 shown in Figure 205. 

Figure 207 shows a nucleotide sequence (SEQ ID NO:290) of a native sequence PRO 1291 (UNQ659) 
cDNA, wherein SEQ ID NO:290 is a clone designated herein as "DNA59610-1556". 

Figure 208 shows the amino acid sequence (SEQ ID NO:291) derived from the coding sequence of SEQ 
ID NO:290 shown in Figure 207. 

Figure 209 shows a nucleotide sequence (SEQ ID NO:292) of a native sequence PRO1105 (UNQ548) 
cDNA, wherein SEQ ID NO:292 is a clone designated herein as "DNA59612-1466V 

Figure 210 shows the amino acid sequence (SEQ ID NO:293) derived from the coding sequence of SEQ 
ID NO;292 shown in Figure 209. 
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Figure 21 1 shows a nucleotide sequence (SEQ ID NO:294) of a naiive sequence PR051 1 (UNQ511) 
cDNA, wherein SEQ ID NO:294 is a clone designated herein as "DNA59613-14I7". 

Figure 212 shows the amino acid sequence (SEQ ID NO:295) derived from the coding sequence of SEQ 
ID NO:294 shown in Figure 211. 

Figure 213 shows a nucleotide sequence (SEQ ID NO:296) of a native sequence PROl 104 (UNQ547) 
5 cDNA, wherein SEQ ID NO:296 is a clone designated herein as "DNA59616-1465". 

Figure 2 14 shows the amino acid sequence (SEQ ID NO:297) derived from the coding sequence of SEQ 
ID NO:296 shown in Figure 213. 

Figure 215 shows a nucleotide sequence (SEQ ID NO:298) of a native sequence PROl 100 (UNQ546) 
cDNA, wherein SEQ ID NO:298 is a clone designated herein as "DNA59619-1464". 
10 Figure 216 shows the amino acid sequence (SEQ ID NO:299) derived from the coding sequence of SEQ 

ID NO:298 shown in Figure 215. 

Figure 217 shows a nucleotide sequence (SEQ ID NO:300) of a native sequence PR0836 (UNQ545) 
cDNA, wherein SEQ ID NO:300 is a clone designated herein as "DNA59620-1463". 

Figure 218 shows the amino acid sequence (SEQ ID NO:301) derived from the coding sequence of SEQ 
15 ID NO:300 shown in Figure 217. 

Figure 219 shows a nucleotide sequence (SEQ ID NO:302) of a native sequence PROl 141 (UNQ579) 
cDNA, wherein SEQ ID NO: 302 is a clone designated herein as "DNA59625-1498". 

Figure 220 shows the amino acid sequence (SEQ ID NO: 303) derived from the coding sequence of SEQ 
ID NO:302 shown in Figure 219. 
20 Figure 221 shows a nucleotide sequence designated herein as DNA33128 (SEQ ID NO:304). 

Figure 222 shows a nucleotide sequence designated herein as DNA34256 (SEQ ID NO:305). 
Figure 223 shows a nucleotide sequence designated herein as DNA47941 (SEQ ID NO:306). 
Figure 224 shows a nucleotide sequence designated herein as DNA54389 (SEQ ID NO:307). 
Figure 225 shows a nucleotide sequence (SEQ ID NO:308) of a native sequence PROl 132 (UNQ570) 
25 cDNA, wherein SEQ ID NO:308 is a clone designated herein as "DNA59767-1489". 

Figure 226 shows the amino acid sequence (SEQ ID NO: 309) derived from the coding sequence of SEQ 
ID NO:308 shown in Figure 225. 

Figure 227 shows a nucleotide sequence (SEQ ID NO:313) of a native sequence PRO 1346 cDNA, 
wherein SEQ ID NO:313 is a clone designated herein as "DNA5 9776- 1600". 
30 Figure 228 shows the amino acid sequence (SEQ ID NO:3 14) derived from the coding sequence of SEQ 

ID NO:313 shown in Figure 227. 

Figure 229 shows a nucleotide sequence (SEQ ID NO:318) of a native sequence PROl 131 (UNQ569) 
cDNA, wherein SEQ ID NO:318 is a clone designated herein as 'DNA59777-1480". 

Figure 230 shows the amino acid sequence (SEQ ID NO:319) derived from the coding sequence of SEQ 
35 ID NO:318 shown in Figure 229. 

Figure 231 shows a nucleotide sequence designated herein as DNA43546 (SEQ ID NO:320). 
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Figure 232 shows a nucleotide sequence (SEQ ID NO:325) of a native sequence PR01281 (UNQ651) 
cDNA, wherein SEQ ID N0.325 is a clone designated herein as "DNA5982(M549". 

Figure 233 shows the amino acid sequence (SEQ ID NO:326) derived from the coding sequence of SEQ 
ID NO:325 shown in Figure 232. 

Figure 234 shows a nucleotide sequence (SEQ ID NO:333) of a native sequence PRO 1064 (UNQIU) 
5 cDNA, wherein SEQ ID NO:333 is a clone designated herein as "DNA59827-1426\ 

Figure 235 shows the amino acid sequence (SEQ ID NO:334) derived from the coding sequence of SEQ 
ID NO:333 shown in Figure 234. 

Figure 236 shows a nucleotide sequence designated herein as DNA45288 (SEQ ID NO:335). 
Figure 237 shows a nucleotide sequence (SEQ ID NO:339) of a native sequence PR01379 (UNQ716) 
10 cDNA, wherein SEQ ID NO:339 is a clone designated herein as -DNA59828-1608". 

Figure 238 shows the amino acid sequence (SEQ ID NO:340) derived from the coding sequence of SEQ 
ID NO:339 shown in Figure 237. 

Figure 239 shows a nucleotide sequence (SEQ ID NO:344) of a native sequence PR0844 (UNQ544) 
cONA, wherein SEQ ID NO:344 is a clone designated herein as "DNA59838-1462". 
15 Figure 240 shows the amino acid sequence (SEQ ID NO:345) derived from the codmg sequence of SEQ 

ID NO: 344 shown in Figure 239. 

Figure 241 shows a nucleotide sequence (SEQ ID NO:346) of a native sequence PROMS (UNQ543) 
cDNA. wherein SEQ ID N0.346 is a clone designed herein as -DNA59839-1461 

Figure 242 shows the amino acid sequence (SEQ ID N0.347) derived from the coding sequence of SEQ 
20 ID NO:346 shown in Figure 241 . 

Figure 243 shows a nucleotide sequence (SEQ ID NO : 348) of a native sequence PRO1097 (UNQ542) 
cDNA. wherein SEQ ID NO:348 is a clone designated herein as -DNA5984M460-. 

Figure 244 shows the amino acid sequence (SEQ ID NO:349) derived from the coding sequence of SEQ 
ID NO:348 shown in Figure 243. 

25 Figure 245 shows a nucleotide sequence (SEQ ID NO:350) of a nauve sequence PROl 153 (UNQ583) 

cDNA. wherein SEQ ID NO:350 is a clone designated herein as "DNA59842- 1502V 

Figure 246 shows the amino acid sequence (SEQ ID NO:35.) derived from the coding sequence of SEQ 

ID NO:350 shown in Figure 245. 

Figure 247 shows a nuc.eo.ide sequence (SEQ ID NO:352, of a native sequence PRO. 154 (UNQ584) 
30 cDNA. wherein SEQ ID NO:352 is a clone designated herein as 'DNA59846-.503-. 

Figure 248 shows the amino acid sequence (SEQ ID NO:353) derived from the coding sequence of SEQ 
ID NO:352 shown in Figure 247. 



35 



Figure 249 shows a nuc.eotide sequence (SEQ ,D NO:354) of a native sequence PRO. 181 (UNQ595) 

cDNA. wherein SEQ ID NO:354 is a clone des.gnated herein as "DNA59847-151 1". 

Figure 250 shows the amino acid seouenrp fsvn m Mr* « tlJ 
Tn Mft k SCqUCnCe (SEQ ID NO:355 > dcriv ^d from the coding sequence of SEQ 

ID NO:354 shown in Figure 249. 
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Figure 251 shows a nucleotide sequence (SEQ ID NO:356) of a native sequence PROl 182 (UNQ596) 
cDNA, wherein SEQ ID NO:356 is a clone designated herein as "DNA59848-1512". 

Figure 252 shows the amino acid sequence (SEQ ID NO:357) derived from the coding sequence of SEQ 
ID NO:356 shown in Figure 251. 

Figure 253 shows a nucleotide sequence (SEQ ID NO:358) of a native sequence PROl 155 (UNQ585) 
5 cDNA, wherein SEQ ID NO:358 is a clone designated herein as *DNA59849-1504\ 

Figure 254 shows the amino acid sequence (SEQ ID NO:359) derived from the coding sequence of SEQ 
ID NO:358 shown in Figure 253. 

Figure 255 shows a nucleotide sequence (SEQ ID NO: 360) of a native sequence PROl 156 (UNQ586) 
cDNA, wherein SEQ ID NO:360 is a clone designated herein as "DNA59853-1505". 
10 Figure 256 shows the amino acid sequence (SEQ ID NO; 36 1) derived from the coding sequence of SEQ 

ID NO: 360 shown in Figure 255. 

Figure 257 shows a nucleotide sequence (SEQ ID NO:362) of a native sequence PRO1098 (UNQ541) 
cDNA, wherein SEQ ID NO:362 is a clone designated herein as "DNA59854-1459*. 

Figure 258 shows the amino acid sequence (SEQ ID NO:363) derived from the coding sequence of SEQ 
15 ID NO:362 shown in Figure 257. 

Figure 259 shows a nucleotide sequence (SEQ ID NO: 364) of a native sequence PROl 127 (UNQ565) 
cDNA, wherein SEQ ID NO:364 is a clone designated herein as 'DNA60283-1484V 

Figure 260 shows the amino acid sequence (SEQ ID NO: 365) derived from the coding sequence of SEQ 
ID NO: 364 shown in Figure 259. 
20 Figure 261 shows a nucleotide sequence (SEQ ID NO:366) of a native sequence PROl 126 (UNQ564) 

cDNA, wherein SEQ ID NO:366 is a clone designated herein as "DNA606I5-1483". 

Figure 262 shows the amino acid sequence (SEQ ID NO: 367) derived from the coding sequence of SEQ 
ID NO: 366 shown in Figure 261. 

Figure 263 shows a nucleotide sequence (SEQ ID NO:368) of a native sequence PRO 1125 (UNQ563) 
25 cDNA, wherein SEQ ID NO:368 is a clone designated herein as -DNA60619-1482V 

Figure 264 shows the amino acid sequence (SEQ ID NO:369) derived from the coding sequence of SEQ 
ID NO:368 shown in Figure 263. 

Figure 265 shows a nucleotide sequence (SEQ ID NO:370) of a native sequence PROH86 (UNQ600) 
cDNA, wherein SEQ ID NO:370 is a clone designated herein as "DNA6062 1-1516". 

Figure 266 shows the amino acid sequence (SEQ ID NO: 371) derived from the coding sequence of SEQ 
ID NO:370 shown in Figure 265. 

Figure 267 shows a nucleotide sequence (SEQ ID NO:372) of a native sequence PR01198 (UNQ611) 
cDNA, wherein SEQ ID N0.372 is a clone designated herein as "DNA60622-1525\ 

Figure 268 shows the amino acid sequence (SEQ ID NO:373) derived from the coding sequence of SEQ 
ID NO: 372 shown in Figure 267. 

Figure 269 shows a nucleotide sequence (SEQ ID NO:374) of a native sequence PROl 158 (UNQ588) 
cDNA, wherein SEQ ID NO:374 is a clone designated herein as "DNA60625-1507". 
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Figure 270 shows the amino acid sequence (SEQ ID N0.375) derived from the coding sequence of SEQ 
ID NO-.374 shown in Figure 269. 

Figure 271 shows a nucleotide sequence (SEQ ID iNO:376) of a native sequence PROl 159 (UNQ589) 
cDNA. wherein SEQ ID NO:376 is a clone designated herein as "DNA60627-1508". 

Figure 272 shows the amino acid sequence (SEQ ID NO: 377) derived from the coding sequence of SEQ 
5 ID NO:376 shown in Figure 271. 

Figure 273 shows a nucleotide sequence (SEQ ID NO:378) of a native sequence PRO 11 24 (UNQ562) 
cDNA, wherein SEQ ID NO;378 is a clone designated herein as "DNA60629-1481 ". 

Figure 274 shows the amino acid sequence (SEQ ID NO: 379) derived from the coding sequence of SEQ 
ID NO:378 shown in Figure 273. 
10 Figure 275 shows a nucleotide sequence (SEQ ID NO:380) of a native sequence PR01287 (UNQ656) 

cDNA, wherein SEQ ID NO:380 is a clone designated herein as 'DNA61755-1554V 

Figure 276 shows the amino acid sequence (SEQ ID N0.381) derived from the coding sequence of SEQ 
ID NO:380 shown in Figure 275. 

Figure 277 shows a nucleotide sequence (SEQ ID NO:386) of a native sequence PR01312 (UNQ678) 
15 cDNA, wherein SEQ ID NO:386 is a clone designated herein as "DNA61 873-' 1574". 

Figure 278 shows the amino acid sequence (SEQ ID NO:387) derived from the coding sequence of SEQ 
ID NO:386 shown in Figure 277. 

Figure 279 shows a nucleotide sequence (SEQ ID NO: 388) of a native sequence PROl 192 (UNQ606) 
cDNA, wherein SEQ ID NO:388 is a clone designated herein as "DNA62814-152I 
20 Figure 280 shows the amino acid sequence (SEQ ID NO: 389) derived from the coding sequence of SEQ 

ID NO:388 shown in Figure 279. 

Figure 281 shows a nucleotide sequence (SEQ ID NO:393) of a native sequence PRO1160 (UNQ590) 
cDNA, wherein SEQ ID NO:393 is a clone designated herein as "DNA62872-1509\ 

Figure 282 shows the amino acid sequence (SEQ ID NO:394) derived from the coding sequence of SEQ 
25 ID NO:393 shown in Figure 281. 

Figure 283 shows a nucleotide sequence (SEQ ID NO:398) of a native sequence PROl 187 (UNQ601) 
cDNA, wherein SEQ ID NO:398 is a clone designated herein as *DNA62876-1517\ 

Figure 284 shows the amino acid sequence (SEQ ID NO:399) derived from the coding sequence of SEQ 
ID NO:398 shown in Figure 283. 
30 Figure 285 shows a nucleotide sequence (SEQ ID NO:400) of a native sequence PRO 1 1 85 (UNQ599) 

cDNA, wherein SEQ ID NO:400 is a clone designated herein as "DNA6288M515\ 

Figure 286 shows the amino acid sequence (SEQ ID NO:401 ) derived from the coding sequence of SEQ 
ID NO:400 shown in Figure 285. 

Figure 287 shows a nucleotide sequence (SEQ ID NO:402) of a native sequence PROl 345 (UNQ700) 
35 cDNA, wherein SEQ ID NO:402 is a clone designated herein as "DNA64852- 1589V 

Figure 288 shows the amino acid sequence (SEQ ID NO:403) derived from the coding sequence of SEQ 
ID NO:402 shown in Figure 287. 
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Figure 289 shows a nucleotide sequence (SEQ ID NO;407) of a native sequence PR01245 (UNQ629) 
cDNA. wherein SEQ ID NO:407 is a clone designated herein as "DNA64884-1527V 

Figure 290 shows the amino acid sequence (SEQ ID NO :408) denved from the coding sequence of SEQ 
ID NO: 407 shown in Figure 289. 

Figure 291 shows a nucleotide sequence (SEQ ID NO:409) of a native sequence PR01358 (UNQ707) 
5 cDNA, wherein SEQ ID NO:409 is a clone designated herein as "DNA64890-I612V 

Figure 292 shows the amino acid sequence (SEQ ID NO:410) derived from the coding sequence of SEQ 
ID NO:409 shown in Figure 291. 

Figure 293 shows a nucleotide sequence (SEQ ID NO:41 1) of a native sequence PRO 11 95 (UNQ608) 
cDNA, wherein SEQ ID NO:411 is a clone designated herein as "DNA65412-1523". 
10 Figure 294 shows the amino acid sequence (SEQ ID NO :4 12) derived from the coding sequence of SEQ 

ID NO:41 1 shown in Figure 293. 

Figure 295 shows a nucleotide sequence (SEQ ID NO:413) of a native sequence PRO 1270 (UNQ640) 
cDNA, wherein SEQ ID N0:413 is a clone designated herein as "DNA66308-1537". 

Figure 296 shows the amino acid sequence (SEQ ID NO:414) derived from the coding sequence of SEQ 
15 ID N0:413 shown in Figure 295. 

Figure 297 shows a nucleotide sequence (SEQ ID NO:4l5) of a native sequence PRO 1271 (UNQ641) 
cDNA, wherein SEQ ID NO:415 is a clone designated herein as "DNA66309-1538". 

Figure 298 shows the amino acid sequence (SEQ ID NO:4 16) derived from the coding sequence of SEQ 
ID NO:415 shown in Figure 297. 
20 Figure 299 shows a nucleotide sequence (SEQ ID NO:4l7) of a native sequence PRO 1375 (UNQ712) 

cDNA, wherein SEQ ID N0:417 is a clone designated herein as "DNA67004-1614\ 

Figure 300 shows the amino acid sequence (SEQ ID NO:418) derived from the coding sequence of SEQ 
ID NO:417 shown in Figure 299. 

Figure 301 shows a nucleotide sequence (SEQ ID N0:419) of a native sequence PR01385 (UNQ720) 
25 cDNA, wherein SEQ ID N0:419 is a clone designated herein as "DNA68869-1610". 

Figure 302 shows the amino acid sequence (SEQ ID NO:420) derived from the coding sequence of SEQ 
ID NO:419 shown in Figure 301. 

Figure 303 shows a nucleotide sequence (SEQ ID NO:421) of a native sequence PR01387 (UNQ722) 
cDNA, wherein SEQ ID NO:421 is a clone designated herein as 'DNA68872-1620". 
30 Figure 304 shows the amino acid sequence (SEQ ID NO:422) derived from the coding sequence of SEQ 

ID NO:421 shown in Figure 303. 

Figure 305 shows a nucleotide sequence (SEQ ID NO:423) of a native sequence PRO 1384 (UNQ721) 
cDNA, wherein SEQ ID NO:423 is a clone designated herein as "DNA71 159-1617". 

Figure 306 shows the amino acid sequence (SEQ ID NO:424) derived from the coding sequence of SEQ 
35 ID NO:423 shown in Figure 305. 
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DETAILED DESCRIPTION OF THE PREFERRE D EMBODIMENTS 
I. Definitions 

The terms "PRO polypeptide* and "PRO" as used herein and when immediately followed by a 
numerical designation refer to various polypeptides, wherein the complete designation (i.e. , PRO/mimber) refers 
to specific polypeptide sequences as described herein. The terms "PRO/number polypeptide" and 
5 "PRO/number" wherein the term "number' is provided an actual numerical designation as used herein 
encompass native sequence polypeptides and polypeptide variants (which are further defined herein). The PRO 
polypeptides described herein may be isolated from a variety of sources, such as from human tissue types or 
from another source, or prepared by recombinant or synthetic methods. 

A "native sequence PRO polypeptide" comprises a polypeptide having the same amino acid sequence 

10 as the corresponding PRO polypeptide derived from nature. Such native sequence PRO polypeptides can be 
isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence PRO 
polypeptide" specifically encompasses naturally -occurring truncated or secreted forms of the specific PRO 
polypeptide (e.g., an extracellular domain sequence), iiarurally-occurring variant forms (e.g., alternatively 
spliced forms) and narurally-occurring allelic variants of the polypeptide. In various embodiments of the 

15 invention, the native sequence PR0281 polypeptide is a mature or full-length native sequence PR0281 
polypeptide comprising amino acids 1 to 345 of Figure 2 (SEQ ID N0:2), the native sequence PR0276 is a full- 
length or mature native sequence PR0276 comprising amino acids 1 through 251 of Figure 4 (SEQ ID NO:6), 
the native sequence PRO 189 is a full-length or mature native sequence PRO 189 comprising arnino acids 1 
through 367 of Figure 6 (SEQ ID NO:8), the native sequence PRO190 polypeptide is a full-length or mature 

20 native sequence PRO 190 polypeptide comprising amino acids 1 through 424 of Figure 9 (SEQ ID NO: 14), the 
native sequence PR0341 is a mature or full-length native sequence PR0341 comprising amino acids 1 to 458 
of Figure 12 (SEQ ID NO:20), the native sequence PRO180 is a full-length or mature native sequence PRO180 
comprising amino acids 1 through 266 of Figure 15 (SEQ ID NO:23), the native sequence PR0194 polypeptide 
is a mature or full-length native sequence PRO 194 polypeptide comprising amino acids 1 to 264 of Figure 18 

25 (SEQ ID NO:28), the native sequence PRO203 polypeptide is a mature or full-length native sequence PRO203 
polypeptide comprising amino acids I to 347 of Figure 20 (SEQ ID NO: 30), the native sequence PRO290 is a 
full-length or mature native sequence PRO290 comprising amino acids 1 through 1003 of Figure 23 (SEQ ID 
NO: 33), the native sequence PR0874 polypeptide comprises amino acids 1 to 321 of Figure 25 (SEQ ID 
NO: 36), the native sequence PR07 10 polypeptide is a mature or full-length native sequence PR07 10 polypeptide 

30 comprising amino acids 1 to 566 of Figure 27 (SEQ ID NO:41), the native sequence PROU51 is a mature or 
full-length native sequence PROl 151 comprising amino acids \ to 259 of Figure 30 (SEQ ID NO:47), the native 
sequence PRO 1282 is a full-length or mature native sequence PROl 282 comprising amino acids 1 or about 24 
through 673 of Figure 32 (SEQ ID NO:52), the native sequence PR0358 is a mature or full-length native 
sequence PR0358 polypeptide comprising amino acids 1 to 811 of Figure 34 (SEQ ID NO:57), the native 

35 sequence PR013 10 is a full-length or mature native sequence PRO 1310 comprising amino acids 1 through 765 
of Figure 36 (SEQ ID NO:62), the native sequence PR0698 polypeptide is a mature or full-length native 
sequence PR0698 polypeptide comprising amino acids 1 to 510 of Figure 38 (SEQ ID N0:67), the native 
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sequence PR0732 polypeptide is a marure or full-length native sequence PR0732 polypeptide comprising amino 
acids 1 to 453 of Figure 41 (SEQ ID NO: 73). the native sequence PROl 120 is a full-length or mature native 
sequence PROI120 comprising amino acids 1 or about 18 to 867 of Figure 47 (SEQ ID NO:84), the native 
sequence PR0537 is a marure or full-length native sequence PR0537 comprising amino acids 1 to 1 15 of Figure 
49 (SEQ ID NO:95), the native sequence PR0536 is a mature or full-length native sequence PR0536 comprising 
5 amino acids 1 to 313 of Figure 51 (SEQ ID NO:97), the native sequence PR0535 is a marure or full-length 
native sequence PR0535 comprising amino acids 1 to 201 of Figure 53 (SEQ ID NO:99), the native sequence 
PR0718 polypeptide is a marure or full-length native sequence PR0718 polypeptide comprising amino acids 1 
to 1 57 of Figure 57 (SEQ ID NO: 103), the native sequence PR0872 polypeptide is a mature or full-length native 
sequence PR0872 polypeptide comprising amino acids 1 to 610 of Figure 64 (SEQ ID NO: 113), the native 
10 sequence PRO1063 polypeptide is a marure or full-length native sequence PRO 1063 polypeptide comprising 
amino acids 1 to 301 of Figure 66 (SEQ ID NO: 115), the native sequence PR0619 is a full-length or mature 
native sequence PR0619 comprising amino acids 1 or about 21 through 123 of Figure 68 (SEQ ID NO:l 17), 
the native sequence PR0943 is a mature or full-length native sequence PR0943 comprising amino acids 1 to 504 
of Figure 70 (SEQ ID NO: 119), the native sequence PROl 188 is a full-length or mature native sequence 
15 PROl 188 comprising amino acids 1 or about 22 to 1 184 of Figure 72 (SEQ ID NO: 124), the native sequence 
PROl 133 is a full-length or mature native sequence PROl 133 comprising amino acids 1 or about 19 through 
438 of Figure 74 (SEQ ID NO: 129), the native sequence PR0784 is a mature or full-length native sequence 
PR0784 comprising amino acids 16 to 228 of Figure 76 (SEQ ID NO: 135), the native sequence PR0783 
polypeptide is a mature or full-length native sequence PR0783 polypeptide comprising amino acids 1 to 489 of 
20 Figure 79 (SEQ ID NO: 138), the native sequence PRO820 is a full-length or mature native sequence PRO820 
comprising amino acids 1 or 16 through 124 of Figure 83 (SEQ ID NO: 146), the native sequence PRO1080 is 
a full-length or mature native sequence PRO1080 comprising amino acids 1 or 23 through 358 of Figure 85 (SEQ 
ID NO: 148), the native sequence PRO1079 is a full-length or mature native sequence PRO1079 comprising 
amino acids 1 or about 30 to 226 of Figure 88 (SEQ ID NO: 151), the native sequence PR0793 is a mature or 
25 full-length native sequence PR0793 comprising amino acids 1 to 1 38 of Figure 90 (SEQ ID NO: 153), the native 
sequence PRO1016 is a full-length or mature native sequence PRO101 6 comprising amino acids 1 or 19 through 
378 of Figure 93 (SEQ ID NO: 156), the native sequence PRO1013 polypeptide is a full-length or mature native 
sequence PRO1013 polypeptide comprising amino acids 1 or 20 through 409 of Figure 95 (SEQ ID NO: 158), 
the native sequence PR0937 polypeptide is a mature or full-length native sequence PR0937 polypeptide 
30 comprising amino acids 1 to 556 of Figure 97 (SEQ ID NO: 160), the native sequence PR0842 is a full-length 
or marure native sequence PR0842 comprising amino acids 1 or about 23 to 119 of Figure 99 (SEQ ID 
NO: 165), the native sequence PR0839 is a full-length or marure native sequence PR0839 comprising amino 
acids 1 or about 24 to 87 of Figure 101 (SEQ ID NO: 167), the native sequence PROl 180 polypeptide is a mature 
or full-length native sequence PROl 180 polypeptide comprising amino acids 1 to 277 of Figure 103 (SEQ ID 
35 NO : 1 69) , the native sequence PRO 1 134 is a mature or full-length native sequence PRO 1 1 34 comprising amino 
acids 1 to 371 of Figure 105 (SEQ ID NO: 171), the native sequence PRO830 is a mature or full-length native 
sequence PRO830 comprising amino acids 1 to 87 of Figure 109 (SEQ ID NO: 175), the native sequence 
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PROl 1 15 is a full-length or mature native sequence PROl 1 15 comprising amino acids 1 or about 21 to 445 of 
Figure 1 1 1 (SEQ ID NO: 177), the native sequence PRO 1277 is a full-length or mature native sequence PRO 1277 
comprising amino acids 1 or about 27 to 678 of Figure 113 (SEQ ID NO: 179), the native sequence PROl 135 
polypeptide is a mature or full-length native sequence PRO 1 135 polypeptide comprising amino acids 1 to 541 
of Figure 115 (SEQ ID NO: 181), the native sequence PROl 114 interferon receptor is a mature or full-length 
native sequence PROl 1 14 interferon receptor comprising amino acids 1 to 3 11 of Figure 118 (SEQ ID NO: 184), 
the native sequence PR0828 polypeptide is a mature or full-length native sequence PR0828 polypeptide 
comprising amino acids 1 to 187 of Figure 120 (SEQ ID NO: 189), the native sequence PRO1009 is a full-length 
or mature native sequence PRO 1009 comprising amino acids 1 or 23 to 615 of Figure 122 (SEQ ID NO: 194), 
the native sequence PRO1007 polypeptide is a full-length or mature native sequence PRO1007 polypeptide 
comprising amino acids 1 or 31 through 346 of Figure 125 (SEQ ID NO: 197), the native sequence PRO 1056 
is a mature or full-length native sequence PRO 1056 comprising amino acids 1 to 120 of Figure 127 (SEQ ID 
NO: 199), the native sequence PR0826 is a mature or full-length native sequence PR0826 comprising amino 
acids 1 to 99 of Figure 129 (SEQ ID NO:201), the native sequence PR0819 is a mature or full-length native 
sequence PR0819 comprising amino acids 1 to 52 of Figure 131 (SEQ ID NO:203), the native sequence 
PRO1006 is a full-length or mature native sequence PRO1006 comprising amino acids 1 or 24 through 392 of 
Figure 133 (SEQ ID NO:205), the native sequence PROl 112 polypeptide is a full-length or mature native 
sequence PROl 112 polypeptide comprising amino acids 1 or 14 through 262 of Figure 135 (SEQ ID NO:207), 
the native sequence PRO1074 polypeptide is a mature or full-length native sequence PRO1074 polypeptide 
comprising amino acids 1 to 331 of Figure 137 (SEQ ID NO:209), the native sequence PRO1005 is a full-length 
or mature native sequence PRO 1005 comprising amino acids 1 or about 21 to 185 of Figure 139 (SEQ ID 
NO:21 1), the native sequence PRO1073 is a full-length or mature native sequence PRO1073 comprising amino 
acids 1 or about 32 to 299 of Figure 141 (SEQ ID N0:213), the native sequence PROH52 is a mature or full- 
length native sequence PROl 152 comprising amino acids 1 to 479 of Figure 144 (SEQ ID NO:216), the native 
sequence PRO 1 136 is a mature or full-length native sequence PROl 136 comprising arnino acids I to 632 of 
Figure 147 (SEQ ID NO:219), the native sequence PR0813 polypeptide is a mature or full-length native 
sequence PR0813 polypeptide comprising arnino acids 1 to 76 of Figure 149 (SEQ ID NO:221), the native 
sequence PRO809 is a full-length or mature native sequence PRO809 comprising amino acids 1 or 19 through 
265 of Figure 151 (SEQ ID NO:223), the native sequence PR0791 is a full-length or mature native sequence 
PR0791 comprising amino acids 1 or 26 through 246 of Figure 153 (SEQ ID N0:225), the native sequence 
PRO1004 is a full-length or mature native sequence PRO1004 comprising amino acids 1 or about 25 through 
1 15 of Figure 155 (SEQ ID NO:227), the native sequence PROl 1 11 is a full-length or mature native sequence 
PROl 1 1 1 comprising amino acids 1 through 653 of Figure 157 (SEQ ID NO:229), the native sequence PR01344 
is a mature or full-length native sequence PR0I344 comprising amino acids 1 to 720 of Figure 159 (SEQ ID 
NO:231), the native sequence PROl 109 is a mature or full-length native sequence PROl 109 comprising amino 
acids 1 to 344 of Figure 161 (SEQ ID NO:236), the native sequence PR01383 is a mature or full-length native 
sequence PR01383 comprising amino acids 1 to 423 of Figure 163 (SEQ ID NO:24I), the native sequence 
PRO1003 polypeptide is a mature or full-length native sequence PRO1003 polypeptide comprising amino acids 
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1 to 84 of Figure 165 (SEQ ID NO:246), the native sequence PROl 108 polypeptide is a mature or full-length 
native sequence PROl 108 polypeptide comprising amino acids 1 to 456 of Figure 167 (SEQ ID NO;248), the 
native sequence PROl 137 polypeptide is a mature or full-length native sequence PROl 137 polypeptide 
comprising amino acids 1 to 240 of Figure 169 (SEQ ID NO:250), the native sequence PROl 138 polypeptide 
is a mature or full-length native sequence PROl 138 polypeptide comprising amino acids 1 to 335 of Figure 171 
(SEQ ID NO:253), the native sequence PRO1054 is a mature or full-length native sequence PRO1054 comprising 
amino acids 1 to 180 of Figure 174 (SEQ ID NO:256), the native sequence PR0994 is a mature or full-length 
native sequence PR0994 comprising amino acids 1 to 229 of Figure 176 (SEQ ID NO:258). the native sequence 
PR0812 is a mature or full-length native sequence PR0812 comprising amino acids 1 to 83 of Figure 178 (SEQ 
ID NO:260), the native sequence PRO1069 polypeptide is a mature or full-length native sequence PRO1069 
polypeptide comprising amino acids 1 to 89 of Figure 180 (SEQ ID NO:262), the native sequence PROl 129 
polypeptide is a mature or full-length native sequence PROl 129 polypeptide comprising amino acids 1 to 524 
of Figure 182 (SEQ ID NO:264), the native sequence PRO 1068 is a full-length or mature native sequence 
PRO 1068 comprising amino acids 1 or about 21 to 124 of Figure 184 (SEQ ID NO;266), the native sequence 
PRO 1066 polypeptide is a mature or full-length native sequence PRO 1066 polypeptide comprising amino acids 
1 to 1 17 of Figure 186 (SEQ ID NO:268), the native sequence PRO! 184 polypeptide is a full-length or mature 
native sequence PROl 184 polypeptide comprising amino acids 1 or 39 through 142 of Figure 188 (SEQ ID 
NO:270), the native sequence PRO1360 is a full-length or mature native sequence PRO1360 comprising amino 
acids 1 or about 30 through 285 of Figure 190 (SEQ ID NO:272), the native sequence PRO1029 is a mature or 
full-length native sequence PRO1029 comprising amino acids 1 to 86 of Figure 192 (SEQ ID NO:274), the 
native sequence PROl 139 is a mature or full-length native sequence PRO 11 39 polypeptide comprising amino 
acids 1 to l3lor 29-131 of Figure 194 (SEQ ID NO:276), the native sequence PRO1309 is a full-length or 
mature native sequence PRO1309 comprising amino acids 1 or about 35 through 522 of Figure 196 (SEQ ID 
NO;278), the native sequence PRO1028 polypeptide is a full-length or mature native sequence PRO1028 
polypeptide comprising amino acids 1 or 20 through 197 of Figure 198 (SEQ ID NO:281), the native sequence 
PRO1027 is a full-length or mature native sequence PRO1027 comprising amino acids 1 or 34 through 77 of 
Figure 200 (SEQ ID NO:283) t the native sequence PROl 107 polypeptide is a full-length or mature native 
sequence PROl 107 polypeptide comprising amino acids 1 or 23 through 477 of Figure 202 (SEQ ID NO:285). 
the native sequence PROl 140 polypeptide is a mature or full-length native sequence PROl 140 polypeptide 
comprising amino acids 1 to 255 of Figure 204 (SEQ ID NO:287), the native sequence PROl 106 polypeptide 
is a full-length or mature native sequence PROl 106 polypeptide comprising amino acids 1 or 17 through 469 
of Figure 206 (SEQ ID NO:289), the native sequence PR0129I is a mature or full-length native sequence 
PR01291 comprising amino acids 1 to 282 of Figure 208 (SEQ ID NO:291), the native sequence PROl 105 
polypeptide is a full-length or mature native sequence PRO 11 05 polypeptide comprising amino acids 1 or 20 
through 180 of Figure 210 (SEQ ID NO:293) ( the native sequence PRO1026 is a full-length or mature native 
sequence PRO1026 comprising amino acids 1 or 26 through 237 of Figure 212 (SEQ ID NO:295) t the native 
sequence PROl 104 is a full-length or mature native sequence PROl 104 comprising amino acids 1 or about 23 
through 341 of Figure 214 (SEQ ID NO:297). the native sequence PROl 100 is a full-length or mature native 
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sequence PRO1100 comprising amino acids 1 or 21 through 320 of Figure 216 (SEQ tD NO:299), the native 
sequence PR0836 is a full- length or mature native sequence PR0836 comprising amino acids 1 or 30 through 
461 of Figure 218 (SEQ ID NO:301), the native sequence PROl 141 is a mature or full-length native sequence 
PR01141 comprising amino acids 1 to 247 of Figure 220 (SEQ ID NO:303), the native sequence PROl 132 is 
a full-length or mature native sequence PROl 132 comprising amino acids 1 or about 23 through 293 of Figure 
5 226 (SEQ ID NO:309), the native sequence NL7 is a mature or full-length native sequence NL7 comprising 
amino acids from about position 5 1 to about position 461 of Figure 228 (SEQ ID NO:3 14), the native sequence 
PROl 131 is a full-length or mature native sequence PROl 131 comprising amino acids 1 through 280 of Figure 
230 (SEQ ID NO:319), the native sequence PR01281 is a full-length or mature native sequence PR01281 
comprising amino acids 1 or about 16 to 775 of Figure 233 (SEQ ID N0:326), the native sequence PRO 1064 
10 is a mature or full-length native sequence PRO1064 comprising amino acids 1 to 153 of Figure 235 (SEQ ID 
NO:334), the native sequence PR01379 is a full-length or mature native sequence PR01379 comprising amino 
acids 1 or about 18 to 574 of Figure 238 (SEQ ID NO:340), the native sequence PR0844 is a full-length or 
mature native sequence PR0844 comprising amino acids 1 or 20 through 111 of Figure 240 (SEQ ID NO: 344), 
the native sequence PR0848 is a full-length or mature native sequence PR0848 comprising amino acids 1 or 36 
15 through 600 of Figure 242 (SEQ ID NO:347), the native sequence PRO 1097 is a full-length or mature native 
sequence PRO1097 comprising amino acids 1 or 21 through 91 of Figure 244 (SEQ ID NO:349), the native 
sequence PROl 153 is a mature or full-length native sequence PROl 153 comprising amino acids 1 to 197 of 
Figure 246 (SEQ ID NO:35 1), the native sequence PROl 154 is a full-length or mature native sequence PR01154 
comprising amino acids 1 or 35 to 941 of Figure 248 (SEQ ID N0:353), the native sequence PROU81 is a 
20 mature or full-length native sequence PRO 1181 comprising amino acids 1 to 437 of Figure 250 (SEQ ID 
NO:355), the native sequence PROl 182 is a mature or full-length native sequence PROl 182 comprising amino 
acids 1 to 271 of Figure 252 (SEQ ID NO:357), the native sequence PROl 155 is a full-length native or mature 
sequence PROl 155 comprising amino acids 1 or 19 through 135 of Figure 254 (SEQ ID NO:359), the native 
sequence PROl 156 is a full-length or mature native sequence PROl 156 comprising amino acids 1 or about 23 
25 to 159ofFigure256(SEQIDNO:361), me iiadve sequence PRO 1098 is a full^^ 

PRO1098 comprising amino acids 1 or 20 through 78 of Figure 258 (SEQ ID NO:363), the native sequence 
PROl 127 is a full-length or mature native sequence PROl 127 comprising amino acids 1 or about 30 through 
67 of Figure 260 (SEQ ID NO:365), the native sequence PROl 126 is a mature or full-length native sequence 
PRO 11 26 comprising amino acids 1 to 402 of Figure 262 (SEQ ID NO:367), the native sequence PROl 125 is 
30 a mature or full-length native sequence PROl 125 comprising amino acids 26 to 447 of Figure 264 (SEQ ID 
NO:369), the native sequence PROl 186 is a full-length or mature native sequence PROl 186 comprising amino 
acids 1 or about 20 through 105 of Figure 266 (SEQ ID NO:371), the native sequence PROl 198 is a full-length 
or mature native sequence PROl 198 comprising amino acids I or about 35 to 229 of Figure 268 (SEQ ID 
NO:373), the native sequence PROl 158 is a full-length or mature native sequence PROl 158 comprising amino 
acids 1 or about 20 to 123 of Figure 270 (SEQ ID NO:375), the native sequence PROl 159 is a mature or full- 
length native sequence PROl 159 comprising ammo acids 1 to 90 of Figure 272 (SEQ ID NO:377), the native 
sequence PROl 124 is a mature or full-length native sequence PROl 124 comprising amino acids 22 through 919 
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of Figure 274 (SEQ ID NO:379), the native sequence PRO 1287 is a mature or full-length native sequence 
PRO 1287 comprising amino acids 1 to 532 of Figure 276 (SEQ ID NO:381), the native sequence PR01312 is 
a full-length or mature native sequence PR01312 comprising amino acids 1 or about 15 to 212 of Figure 278 
(SEQ ID NO:387), the native sequence PROl 192 is a full-length or mature native sequence PROl 192 comprising 
amino acids 1 or about 22 to 2L5 of Figure 280 (SEQ ID NO:389), the native sequence PROl 160 is a mature 
5 or full-length native sequence PROl 160 comprising amino acids 1 to 90 of Figure 282 (SEQ ID NO:394), the 
native sequence PROl 187 is a full-length or mature native sequence PROl 187 comprising amino acids 1 or about 
18 through 120 of Figure 284 (SEQ ID NO: 399), the native sequence PROl 185 is a full-length or mature native 
sequence PRO 11 85 comprising amino acids 1 or about 22 through 198 of Figure 286 (SEQ ID NO:40l), the 
native sequence PRO 1345 is a mature or full-length native sequence PRO 1 345 comprising amino acids 1 to 206 
10 of Figure 288 (SEQ ID NO:403) t the native sequence PR01245 is a full-length or mature native sequence 
PR01245 comprising amino acids 1 or about 19 to 104 of Figure 290 (SEQ ID NO:408), the native sequence 
PRO 1 358 is a full-length or mature native sequence PR01358 comprising amino acids 1 or about 19 through 
444 of Figure 292 (SEQ ID NO:410), the native sequence PRO 1 195 is a full-length or mature native sequence 
PROl 195 comprising amino acids 1 or about 23 through 151 of Figure 294 (SEQ ID NO:412), the native 
15 sequence PRO1270 is a mature or full-length native sequence PRO1270 comprising amino acids 1 to 313 of 
Figure 296 (SEQ ID NO;414), the native sequence PR01271 is a mature or full-length native sequence PROI271 
comprising amino acids 1 to 208 of Figure 298 (SEQ ID NO:416), the native sequence PR01375 is a full-length 
or mature native sequence PRO 1375 comprising amino acids 1 through 198 of Figure 300 (SEQ ID NO:418), 
the native sequence PRO 1385 is a mature or full-length native sequence PRO 1 385 comprising amino acids 1 to 
20 128 of Figure302 (SEQ ID NO:420), the native sequence PR01387 is a mature or full-length native sequence 
PR01387 comprising amino acids 1 to 394 of Figure 304 (SEQ ID N0:422) and the native sequence PR01384 
is a full-length or mature native sequence PR01384 comprising amino acids 1 to 229 of Figure 306 (SEQ ID 
NO:424). Start and stop codons are shown in bold font and underlined in the figures. 

The PRO polypeptide "extracellular domain" or "ECD" refers to a form of the PRO polypeptide which 
25 is essentially free of the transmembrane and cytoplasmic domains . Ordinarily, a PRO polypeptide ECD will have 
less than 1 % of such transmembrane and/or cytoplasmic domains and preferably, will have less than 0.5% of 
such domains. It will be understood thai any transmembrane domains identified for the PRO polypeptides of 
the present invention are identified pursuant to criteria routinely employed in the art for identifying that type of 
hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most likely by no more 
30 than about 5 amino acids at either end of the domain as initially identified . Optionally, therefore, an extracellular 
domain of a PRO polypeptide may contain from about 5 or fewer amino acids on either or the transmembrane 
domain as initially identified. 

"PRO polypeptide variant' means an active PRO polypeptide as defined above or below having at least 
about 80% amino acid sequence identity with a full-length native sequence PRO polypeptide sequence as 
55 disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide as disclosed 
herein, an extracellular domain of a PRO polypeptide as disclosed herein or any other fragment of a full-length 
PRO polypeptide sequence as disclosed herein. Such PRO polypeptide variants include, for instance, PRO 
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polypeptides wherein one or more amino acid residues are added, or deleted, at the N- or C-tenninus of the full- 
length native amino acid sequence. Ordinarily, a PRO polypeptide variant will have at least about 80% amino 
acid sequence identity, preferably at least about 8 1 % amino acid sequence identity, more preferably at least about 
82% amino acid sequence identity, more preferably at least about 83% amino acid sequence identity, more 
preferably at least about 84% amino acid sequence identity, more preferably at least about 85% amino acid 

5 sequence identity, more preferably at least about 86% amino acid sequence identity, more preferably at least 
about 87% amino acid sequence identity, more preferably at least about 88% amino acid sequence identity, more 
preferably at least about 89% amino acid sequence identity, more preferably at least about 90% amino acid 
sequence identity, more preferably at least about 91% amino acid sequence identify, more preferably at least 
about 92% amino acid sequence identity, more preferably at least about 93% amino acid sequence identity, more 

10 preferably at least about 94% amino acid sequence identity, more preferably at least about 95% amino acid 
sequence identity, more preferably at least about 96% amino acid sequence identity, more preferably at least 
about 97% amino acid sequence identity, more preferably at least about 98% amino acid sequence identity and 
most preferably at least about 99% amino acid sequence identity with the amino acid sequence of the full-length 
native amino acid sequence as disclosed herein. Ordinarily, PRO variant polypeptides are at least about 10 

15 amino acids in length, often at least about 20 amino acids in length, more often at least about 30 amino acids in 
length, more often at least about 40 amino acids in length, more often at least about 50 amino acids in length, 
more often at least about 60 amino acids in length, more often at least about 70 amino acids in length, more often 
at least about 80 amino acids in length, more often at least about 90 amino acids in length, more often at least 
about 100 amino acids in length, more often at least about 150 amino acids in length, moTe often at least about 

20 200 amino acids in length, more often at least about 300 amino acids in length, or more. 

"Percent (%) amino acid sequence identity" with respect to the PRO polypeptide sequences identified 
herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the 
ammo acid residues in the specific PRO polypeptide sequence, after aligning the sequences and introducing gaps, 
if necessary, to achieve the maximum percent sequence identity, and not considering any conservative 

25 substitutions as part of the sequence identity. Alignment for purposes of determining percent amino acid 
sequence identity can be achieved in various ways that are within the skill in the an, for instance, using publicly 
available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those 
skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed 
to achieve maximal alignment over the full length of the sequences being compared. For purposes herein, 

30 however, % amino acid sequence identity values are generated using the WU-BLAST-2 computer program 
(Altschul et al., Methods in Enzvroology 266:460480 (1996)). Most of the WU-BLAST-2 search parameters 
are set to the default values. Those not set to default values, i.e., the adjustable parameters, are set with the 
following values: overlap span = 1, overlap fraction = 0.125, word threshold (T) - U, and scoring matrix 
= BLOSUM62. For purposes herein, a % amino acid sequence identity value is determined by dividing (a) the 

35 number of matching identical amino acid residues between the amino acid sequence of the PRO polypeptide of 
interest having a sequence derived from the native PRO polypeptide and the comparison amino acid sequence 
of interest (i.e., the sequence against which the PRO polypeptide of interest is being compared which may be 
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a PRO variant polypeptide) as determined by WU-BLAST-2 by (b) the total number of amino acid residues of 
the PRO polypeptide of interest. 

"PRO variant polynucleotide" or "PRO variant nucleic acid sequence" means a nucleic acid molecule 
which encodes an active PRO polypeptide as defined below and which has at least about 80% nucleic acid 
sequence identity with a nucleotide acid sequence encoding a full-length native sequence PRO polypeptide 
5 sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide 
as disclosed herein, an extracellular domain of a PRO polypeptide as disclosed herein or any other fragment of 
a full-length PRO polypeptide sequence as disclosed herein. Ordinarily, a PRO variant polynucleotide will have 
at least about 80% nucleic acid sequence identity, more preferably at least about 81% nucleic acid sequence 
identity, more preferably at least about 82% nucleic acid sequence identity, more preferably at least about 83% 
10 nucleic acid sequence identity, more preferably at least about 84% nucleic acid sequence identity, more 
preferably at least about 85% nucleic acid sequence identity, more preferably at least about 86% nucleic acid 
sequence identity, more preferably at least about 87% nucleic acid sequence identity, more preferably at least 
about 38% nucleic acid sequence identity, more preferably at least about 89% nucleic acid sequence identity, 
more preferably at least about 90% nucleic acid sequence identity, more preferably at least about 91 % nucleic 
15 acid sequence identity, more preferably at least about 92% nucleic acid sequence identity, more preferably at 
least about 93% nucleic acid sequence identity, more preferably at least about 94% nucleic acid sequence 
identity, more preferably at least about 95 % nucleic acid sequence identity, more preferably at least about 96% 
nucleic acid sequence identity, more preferably at least about 97% nucleic acid sequence identity, more 
preferably at least about 98% nucleic acid sequence identity and yet more preferably at least about 99% nucleic 
20 acid sequence identity with the nucleic acid sequence encoding a full-length native sequence PRO polypeptide 
sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide 
as disclosed herein, an extracellular domain of a PRO polypeptide as disclosed herein or any other fragment of 
a full-length PRO polypeptide sequence as disclosed herein. Variants do not encompass the native nucleotide 
sequence. 

25 Ordinarily, PRO variant polynucleotides are at least about 30 nucleotides in length, often at least about 

60 nucleotides in length, more often at least about 90 nucleotides in length, more often at least about 120 
nucleotides in length, more often at least about 150 nucleotides in length, more often at least about 180 
nucleotides in length, more often at least about 210 nucleotides in length, more often at least about 240 
nucleotides in length, more often at least about 270 nucleotides in length, more often at least about 300 
30 nucleotides in length, more often at least about 450 nucleotides in length, more often at least about 600 
nucleotides in length, more often at least about 900 nucleotides in length, or more. 

"Percent (%) nucleic acid sequence identity- with respect to PRO-encoding nucleic acid sequences 
identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the 
nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if 
35 necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent 
nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, 
using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) 



307 



WO 99/63088 



PCT/US99/12252 



software. For purposes herein, however, % nucleic acid sequence identity values are generated using the WU- 
BLAST-2 computer program (Altschul et al. , Methods in Enzvmologv 266:460-480 (1996)). Most of the WU- 
BLAST-2 search parameters are set to the default values. Those not set to default values, i.e., the adjustable 
parameters, are set with the following values: overlap span = 1 , overlap fraction = 0.125, word threshold (T) 
= 11, and scoring matrix = BLOSUM62. For purposes herein, a % nucleic acid sequence identity value is 
5 determined by dividing (a) the number of matching identical nucleotides between the nucleic acid sequence of 
the PRO polypeptide-encoding nucleic acid molecule of interest having a sequence derived from the native 
sequence PRO polypeptide -encoding nulceic acid and the comparison nucleic acid moelcule of interest (i.e., the 
sequence against which the PRO polypeptide-encoding nucleic acid molecule of interest is being compared which 
may be a variant PRO polynucleotide) as determined by WU-BLAST-2 by (b) the total number of nucleotides 
10 of the PRO polypeptide-encoding nucleic acid molecule of interest. 

In other embodiments, PRO variant polynucleotides are nucleic acid molecules that encode an active 
PRO polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash 
conditions, to nucleotide sequences encoding a full-length PRO polypeptide as disclosed herein. PRO variant 
poi, peptides may be those that are encoded by a PRO variant polynucleotide. 
15 The term "positives", in the context of sequence comparison performed as described above, includes 

residues in the sequences compared that are not identical but have similar properties (e.g. as a result of 
conservative substitutions, see Tabic 1 below). For purposes herein, the % value of positives is determined by 
dividing (a) the number of amino acid residues scoring a positive value between the PRO polypeptide amino acid 
sequence of interest having a sequence derived from the native PRO polypeptide sequence and the comparison 
20 amino acid sequence of interest (i.e., the amino acid sequence against which the PRO polypeptide sequence is 
being compared) as determined in the BLOSUM62 matrix of WU-BLAST-2 by (b) the total number of amino 
acid residues of the PRO polypeptide of interest. 

"Isolated, " when used to describe the various polypeptides disclosed herein, means polypeptide that has 
been identified and separated and/or recovered from a component of its natural environment. Contaminant 
25 components of its natural environment are materials that would typically interfere with diagnostic or therapeutic 
uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous 
solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to obtain at least 
15 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenator. or (2) to 
homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, preferably, 
silver stain. Isolated polypeptide includes polypeptide in situ within recombinant cells, since at least one 
component of the PRO polypeptide natural environment will not be present. Ordinarily, however, isolated 
polypeptide will be prepared by at least one purification step. 

An "isolated" PRO polypeptide-encoding nucleic acid is a nucleic acid molecule that is identified and 
separated from at least one contaminant nucleic acid molecule with which it is ordinarily associated in the natural 
source of the PRO polypeptide nucleic acid. An isolated PRO polypeptide nucleic acid molecule is other than 
in the form or setting in which it is found in nature. Isolated PRO polypeptide nucleic acid molecules therefore 
are distinguished from the specific PRO polypeptide nucleic acid molecule as it exists in natural cells. However, 
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an isolated PRO polypeptide nucleic acid molecule includes PRO polypeptide nucleic acid molecules contained 
in cells that ordinarily express the PRO polypeptide where, for example, the nucleic acid molecule is in a 
chromosomal location differeni from that of natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. The control sequences that arc suitable for prokaryotes, 
5 for example , include a promoter, optionally an operator sequence, and a ribosome binding site . Eukaryotic cells 
are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 
acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or 
10 enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 
L-romplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide 
15 adaptors or linkers are used in accordance with conventional practice. 

The term "antibody" is used in the broadest sense and specifically covers, for example, single anti-PRO 
monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-PRO antibody 
compositions with polyepitopic specificity, single chain anti-PRO antibodies, and fragments of anti-PRO 
antibodies (see below). The term "monoclonal antibody" as used herein refers to an antibody obtained from a 
20 population of substantially homogeneous antibodies, i.e. , the individual antibodies comprising the population are 
identical except for possible naturally-occurring mutations that may be present in minor amounts. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 
generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. 
In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
25 temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. The higher the degree 
of desired homology between the probe and hybridizable sequence, the higher the relative temperature which 
can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions 
more stringent, while lower temperatures less so. For additional details and explanation of stringency of 
30 hybridization reactions, see Ausubel et al., Current Protocols in Molecular Biology Wiley Interscience 
Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be identified by those 
that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium 
chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a 
35 denaturing agent, such as formamide, for example, 50% (v/v) fonnamide with 0.1% bovine serum 
alburnuVO. 1 % FicoU/0. 1 % polyv iny Ipyrrolidone/SOmMsodium phosphate buffer at pH 6.5 with 750 mM sodium 
chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M 
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sodium citrate). 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardfs solution, 
sonicated salmon sperm DNA (50 jig/ml), 0.1 % SDS, and 10% dextran sulfate at 42 °C, with washes at 42°C 
in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide at 55*0, followed by a high- stringency wash 
consisting of 0.1 x SSC containing EDTA at 55 °C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al., Molecular 
Cloning: A Laboratory Manual. New York: Cold Spring Harbor Press, 1989, and include the use of washing 
solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 
described above. An example of moderately stringent conditions is overnight incubation ai 37°C in a solution 
comprising: 20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 
7.6), 5 x Denhardfs solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37-50 c C. The skilled artisan will recognize how to adjust 
the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope against 
v* fuch an antibody can be made, yet is short enough such that it does not interfere with activity of the polypeptide 
to which it is fused. The tag polypeptide preferably also is fairly unique -so that the antibody does not 
substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino acid 
residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 and 20 amino 
acid residues). 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 
binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulm 
constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e., is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 
immunoglobulin, such as IgG-1, lgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-l and IgA-2), IgE, IgD 
or IgM. 

"Active" or "activity" for the purposes herein refers to form(s) of a PRO polypeptide which retain a 
biological and/or an immunological activity of native or naturally-occurring PRO, wherein "biological" activity 
refers to a biological function (either inhibitory or stimulatory) caused by a native or narurally-ctcurring PRO 
other than the ability to induce the production of an antibody against an antigenic epitope possessed by a native 
or naturally -occurring PRO and an "immunological" activity refers to the ability to induce the production of an 
antibody against an antigenic epitope possessed by a native or naturally-occurring PRO. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 
blocks, inhibits, or neutralizes a biological activity of a native PRO polypeptide disclosed herein. In a similar 
manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a biological 
activity of a native PRO polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically 
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include agonist or antagonist antibodies or antibody fragments, fragments or amino acid sequence variants of 
native PRO polypeptides, peptides, small organic molecules, etc. Methods for identifying agonists or 
antagonists of a PRO polypeptide may comprise contacting a PRO polypeptide with a candidate agonist or 
antagonist molecule and measuring a detectable change in one or more biological activities normally associated 
with the PRO polypeptide. 

"Treatment" refers to both therapeutic treatment and prophylactic or preventative measures, wherein 
the object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those in need of 
treatment include those already with the disorder as well as those prone to have the disorder or those in whom 
the disorder is to be prevented. 

"Chronic" administration refers to adniinistration of the agent(s) in a continuous mode as opposed to 
an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 
"Intermittent" administration is treatment that is not consecutively done without interruption, but rather is cyclic 
in nature. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, 
aomesiic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, cattle, horses, sheep, pigs, goats, 
rabbits, etc. Preferably, the mammal is human. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 
(concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutical^ acceptable carriers, excipiems, or stabilizers which 
are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often 
the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 
acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates including 
glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt- 
forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™, polyethylene glycol (PEG), 
and PLURONICS™. 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 
variable region of the intact antibody. Examples of antibody fragments include Fab, Fab', F(ab')3, and Fv 
fragments; diabodies; linear antibodies (Zapata et al., Protein Eng. 8(10): 1057-1062 [1995]); single-chain 
antibody molecules; and multispecific antibodies formed from antibody fragments. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 
fragments, each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflecting the 
ability to crystallize readily. Pepsin treatment yields an F(ab*) ; fragment that has two antigen-combining sites 
and is still capable of cross-linking antigen. 
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"Fv" is the minimum antibody fragment which contains a complete antigen- recognition and -binding 
site. This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non-covalent 
association. It is in this configuration that the three CDRs of each variable domain interact to define an antigen- 
binding site on the surface of the VH-VL dimer. Collectively, the six CDRs confer antigen-binding specificity 
to the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs specific 
for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding 
site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
(CHI) of the heavy chain. Fab fragments differ from Fab' fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CH 1 domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear 
a free thiol group. F(ab')2 antibody fragments originally were produced as pairs of Fab* fragments which have 
hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to one 
of ivvo clearly distinct types, called kappa and lambda, based on the amino acid sequences of their constant 
domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins 
can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, and 
IgM, and several of these may be further divided into subclasses (isorypes), e.g., IgGl, IgG2, IgG3, IgG4, IgA, 
and IgA2. 

"Single-chain Fv" or "sFv" antibody fragments comprise the VH and VL domains of antibody, wherein 
these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a 
polypeptide linker between the VH and VL domains which enables the sFv to form the desired structure for 
antigen binding. For a review of sFv. see Pluckthun in The Pharmacology of Monoclonal Antibodies, vol. 113, 
Rosenburg and Moore eds., Springer- Verlag, New York, pp. 269-315 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which 
fragments comprise a heavy-chain variable domain (VH) connected to a light-chain variable domain (VL) in the 
same polypeptide chain (VH - VL). By using a linker that is too short to allow pairing between the two domains 
on the same chain, the domains are forced to pair with the complementary domains of another chain and create 
two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 93/1 1 161; and 
Hollinger et aL, Proc. Natl. A cad. Sci. USA 90:6444-6448 (1993). 

An "isolated" antibody is one which has been identified and separated and/or recovered from a 
component of its natural environment. Contaminant components of its natural environment are materials which 
would interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and 
other protcinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) 
to greater than 95 % by weight of antibody as determined by the Lowry method, and most preferably more than 
99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-termina! or internal amino acid 
sequence by use of a spinning cup scqucnator, or (3) to homogeneity by SDS-PAGE under reducing or 
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nonrcducing conditions using Coomassie blue or, preferably, silver stain. Isolated antibody includes the antibody 
in situ within recombinant cells since at least one component of the antibody's natural environment will not be 
present. Ordinarily, however, isolated antibody will be prepared by at least one purification step. 

The word "label" when used herein refers to a detectable compound or composition which is conjugated 
directly or indirectly to the antibody so as to generate a 'labeled' antibody. The label may be detectable by itself 
5 (e.g. radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical 
alteration of a substrate compound or composition which is detectable. 

By "solid phase" is meant a non-aqueous matrix to which the antibody of the present invention can 
adhere. Examples of solid phases encompassed herein include those formed partially or entirely of glass (e.g. , 
controlled pore glass), polysaccharides (e.g., agarose), polyacrylamides. polystyrene, polyvinyl alcohol and 
10 silicones. In certain embodiments, depending on the context, the solid phase can comprise the well of an assay 
plate; in others it is a purification column (e.g., an affinity chromatography column). This term also includes 
a discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/oT surfactant 
■vHJ; if usefdl for delivery of a drug (such as a PRO polypeptide or antibody thereto) to a mammal. The 
15 components of the liposome are commonly arranged in a bilayer formation, similar to the lipid arrangement of 
biological membranes. 

A "small molecule" is defined herein to have a molecular weight below about 500 Daltons. 



II Compositio ns and Methods of the Invention 

20 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO polypeptides. In particular, cDNAs encoding various PRO 
polypepudes have been identified and isolated, as disclosed in further detail in the Examples below. It is noted 
that proteins produced in separate expression rounds may be given different PRO numbers but the UNQ number 
is unique for any given DNA and the encoded protein, and will not be changed. However, for sake of 
25 simplicity, in the present specification the protein encoded by the full length native nucleic acid molecules 
disclosed herein as well as all further native homologues and variants included in the foregoing definition of 
PRO, will be referred to as "PRO/number", regardless of their origin or mode of preparation. 

As disclosed in the Examples below, various cDNA clones have been deposited with the ATCC. The 
actual nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of the 
30 deposited clone using routine methods in the an. The predicted amino acid sequence can be determined from 
the nucleotide sequence using routine skill. For the PRO polypeptides and encoding nucleic acids described 
herein, Applicants have identified what is believed to be the reading frame best identifiable with the sequence 
information available at the time. 

35 1- Full-length P RQ281 Pr>lvp pptiH»c 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR028I (UNQ244). In particular. cDNA encoding a PRQ281 
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polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST-2 sequence al.gnmen. computer program, it has been found that a full-length 
native sequence PR0281 (shown in Figure 2 and SEQ ID N0:2) has certain amino acid seance idenuty w.th 
the rat TEGT protein. Accordingly, i. is presently believed tha, PR028 . disclosed in the present appl.cat.on .s 
a newly identified TEGT homolog and may possess activiry typical of that protein. 

2. Full-length P WHT76 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequencesencodingpolypeptides 
referred to in the present application as PR0276 (UNQ243). .n particular, cDNA encoding a PR0276 
polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA16435-1208 sequence encodes a novel factor designated herein as PR0276; 
using WU-BLAST-2 sequence alignment computer programs, no significant sequence identities to any known 
proteins were revealed. The sequence identity identifications which were found are listed below in the examples. 

3. Full-length PBQ189 Polypeptides 

The present invention provides newly idennfiedandisolatednucleotide sequences encoding polypeptides 
referred to in the present application as PR0189. In particular. Applicants have identified and isolated cDNA 
encoding a PR0189 polypepude. as disclosed in further detail in the Examples below. To Applicants present 
knowledge, the DNA21624-1391 nucleotide sequence encodes a novel factor; using BLAST and FastA sequence 
alignment computer programs, no significant sequence identities to any known proteins were revealed. 

4. Full-length PKO190 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequencesencodingpolypepudes 
referred to in the present application as PRO190. In particular, Applicants have identified and isolated cDNA 
encoding a PRO190 polypeptide, as disclosed in further detail in the Examples below. The PR019T>encodmg 
clone was isolated from a human retina library. To Applicants present knowledge, the DNA23334-1392 
nucleotide sequence encodes a novel multiple transmembrane spanning protein; using BLAST and FastA 
sequence alignment computer programs, .here is some sequence identity with CMP-sialicacid and UDP-galactose 
transporters, indicating that PRO190 may be related to transporter or mat PRO190 may be a novel transporter. 



S. Full-length PRQ341 Polypeptides 

The present invention provides newly identifiedand isolatednucleotide sequences encoding polypeptides 
referred to in the present application as PR0341 (UNQ300). In particular, cDNA encoding a PR0341 
polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA26288-1239 clone was isolated from a human placenta library. As far as is known, the 
DNA26288-1239 sequence encodes a novel factor designated herein as PR0341; using the WU-BLAST-2 
sequence alignment computer program, no significant sequence identities to any known proteins were revealed. 
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6. Full-length PRO180 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 180 (UNQ154). In particular, cDNA encoding a PRO 180 
polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA26843-1389 clone was isolated from a human placenta library using oligos formed from 
DNA12922 isolated from an amylase screen. As far as is known, the DNA26843- 1389 sequence encodes a novel 
factor designated herein as PRO 180. 

7. Full-length PROI94 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0194. In particular, Applicants have identified and isolated cDNA 
encoding a PRO 194 polypeptide, as disclosed in further detail in the Examples below. The PR0194-encoding 
clone was isolated from a human fetal lung library. To Applicants present knowledge, the DNA26844-I394 
nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment computer programs, 
no significant sequence identities to any known proteins were revealed. 

8. Full-length PRO203 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO203. In particular, Applicants have identified and isolated cDNA 
encoding a PRO203 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PRO203 polypeptide has sequence identity 
with GST ATPase. Accordingly, it is presently believed that PRO203 polypeptide disclosed in the present 
application is a newly identified member of the ATPase family and possesses activity typical of the GST ATPase. 

9. Full-length PRO290 Polvrentide* 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0290. In particular, cDNA encoding a PRO290 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 23 (SEQ ID NO: 33), revealed sequence identities 
between the PRO290 amino acid sequence and the following Dayhoff sequences: P R99800, CC4H_HUMAN, 
YCS2_ YEAST, CEF35G12_13. HSFAN l, MMU52461 J, MMU70015J, HSU67615J, CET01H10J and 
CELT28F2_6. 

It is currently believed that PRO290 is an intracellular protein related to one or more of the above 

proteins. 
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10. Full-length PRQ874 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0874. In particular. Applicants have identified and isolated cDNA 
encoding a PR0874 polypeptide, as disclosed in further detail in the Examples below. The PR0874-encoding 
clone was isolated from a human fetal lung library. To Applicants present knowledge, the DNA4062 1-1440 
nucleotide sequence encodes a novel factor. Although, using BLAST and FastA sequence alignment computer 
programs, some sequence identity with known proteins was revealed. 

11. Full-length PRO710 Polypeptides 

The present invention provides newly identified and isolated nucleotidesequencesencoding polypeptides 
referred to in the present application as PRO710. In particular, Applicants have identified and isolated cDNA 
encoding a PRO710 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PR07 10 polypeptide has significant similarity 
to the CDC45 protein. Accordingly, it is presently believed that PRO710 polypeptide disclosed in the present 
application is a newly identified CDC45 homolog. 

12. Full-length PRQ1151 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 15 1 . In particular, cDNA encoding a PROl 151 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST-2 sequence alignment computer program, it has been found that a full-length 
native sequence PROl 151 (shown in Figure 30 and SEQ ID NO:47) has certain amino acid sequence identity 
with the human 30 kD adipocyte complement-related precursor protein (ACR3 HUMAN). Accordingly, it is 
presently believed that PROl 151 disclosed in the present application is a newly identified member of the 
complement protein family and may possess activity typical of that family. 

13. Full-length PRQ1282 Polype ptide 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01282. In particular, cDNA encoding a PRO 1282 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA45495-1550 sequence encodes a novel factor designated herein as 
PROl 282. Using WU-BLAST-2 sequence alignment computer programs, some sequence identities between 
PR01282 and other leucine rich repeat proteins were revealed, as discussed in the examples below, indicating 
that a novel member of the leucine rich repeat superfamily has been identified. 

14. Full-length PR Q358 Polyp eptide 

The present invention further provides newly identified and isolated nucleotide sequences encoding a 
polypeptide referred to in the present application as PR0358. In particular, Applicants have identified and 



316 



WO 99/63088 



PCT/US99/12252 



isolated cDNA encoding a novel human Toll polypeptide (PR0358), as disclosed in further detail in the 
Examples below. Using BLAST and FastA sequence alignment computer programs, Applicants found that the 
coding sequence of PR0358 shows significant homology to DNA sequences HSU8854CM, HSU88878_1, 
HSU88879 _1, HSU88880J, HS88881J, and HSU79260_1 in the GenBank database. With the exception 
of HSU79260_1, the noted proteins have been identified as human toll-like receptors. 

Accordingly, it is presently believed that the PR0358 proteins disclosed in the present application are 
newly identified hu m a n homologues of the Drosophila protein Toll, and are likely to play an important role in 
adaptive immunity. More specifically, PR0358 may be involved in inflammation, septic shock, and response 
to pathogens, and play possible roles in diverse medical conditions that are aggravated by immune response, such 
as, for example, diabetes, ALS, cancer, rheumatoid arthritis, and ulcers. The role of PR0385 as pathogen 
pattern recognition receptors, sensing the presence of conserved molecular structures present on microbes, is 
further supported by the data disclosed in the present application, showing that a known human Toll-like 
receptor, TLR2 is a direct mediator of LPS signaling. 

15. Full-length PRO1310 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1310. In particular, cDNA encoding a PRO1310 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer programs, it has been found that a full-length native 
sequence PRO1310 (shown in Figure 36 and SEQ ID NO:62) has certain amino acid sequence identity with 
carboxypeptidase X2. Accordingly, it is presently believed that PRO1310 disclosed in the present application 
is a newly identified member of the carboxypeptidase family and may possess carboxyl end amino acid removal 
activity. 

16. Full-length PRQ698 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0698. In particular. Applicants have identified and isolated cDNA 
encoding a PR0698 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PR0698 polypeptide has significant similarity 
to the olfactomedin protein. Accordingly, it is presently believed that PR0698 polypeptide disclosed in the 
present application may be a newly identified olfactomedin homolog. 

17. Full-length PRQ732 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0732. In particular, Applicants have identified and isolated cDNA 
encoding a PR0732 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PR0732 polypeptide has significant similarity 
to the human placental Diff33 protein. Accordingly, it is presently believed that PR0732 polypeptide disclosed 
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in the present application is a newly identified Diff33 homolog. 

18- FulMeneth PRO1I20 Polypep tide 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1120. In particular, cDNA encoding a PROl 120 polypeptide has 
5 been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer programs, it has been found that a full-length native 
sequence PROl 120 (shown in Figure 47 and SEQ ID NO:84) has certain amino acid sequence identity with the 
known sulfatase proteins designated CELK09C4J , and GL6S_HUMAN, respectively, in the Dayhoff database 
(version 35.45 SwissProt 35). Accordingly, it is presently believed that PROl 120 disclosed in the present 
10 application is a newly identified member of the sulfatase family and may possess activity typical of sulfatases. 

19. FulMeneth PR Q537 Polypeptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0537. In particular, cDNA encoding a PR0537 polypeptide has been 

15 identified and isolated, as disclosed in further detail in the Examples below. The DNA49141-1431 clone was 
isolated from a human placenta library using a trapping technique which selects for nucleotide sequences 
encoding secreted proteins. Thus, the DNA49141-1431 clone does encode a secreted factor. As far as is 
known, the DNA49141-1431 sequence encodes a novel factor designated herein as PR0537; using the WU- 
BLAST2 sequence alignment computer program, no significant sequence identities to any known proteins were 

20 revealed. 

20 - Full-length PRQ536 Polyp eptide 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0536. In particular, cDNA encoding a PR0536 polypeptide has been 
25 identified and isolated, as disclosed in further detail in the Examples below. 

The DNA49142-1430 clone was isolated from a human infant brain library using a trapping technique 
which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA49142-1430 clone does 
encode a secreted factor. As far as is known, the DNA4914M430 sequence encodes a novel factor designated 
herein as PR0536; using the WU-BLAST-2 sequence alignment computer program, no significant sequence 
30 identities to any known proteins were revealed. 

21- Full-length PRQ535 Polyp eptide 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0535. In particular, cDNA encoding a PR0535 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PRQ535 (shown in Figure 53 and SEQ ID NO:99) has amino acid sequence identity with a 
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putative peptidyl-prolyl isomerasc protein. Accordingly, it is presently believed that PR0535 disclosed in the 
present application is a newly identified member of the isomerase protein family and may possess activity typicaJ 
of those proteins. 

22. Full-length PRQ718 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PR0718. In particular, Applicants have identified and isolated cDNA 
t 

encoding a PR0718 polypeptide, as disclosed in further detail in the Examples below. The PR0718-encoding 
clone was isolated from a human fetal lung library. To Applicants present knowledge, the DNA49647-1398 
nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment computer programs, 
no significant sequence identities to any known proteins were revealed. 

23. Full-length PRQ872 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0872. In particular, Applicants have identified and isolated cDNA 
encoding a PR0872 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PR0872 polypeptide has sequence identity 
with dehydrogenases. Accordingly, it is presently believed that PR0872 polypeptide disclosed in the present 
application is a newly identified member of the dehydrogenase family and possesses dehydrogenase activity. 

24. Full-length PRO1063 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1063. In particular. Applicants have identified and isolated cDNA 
encoding a PRO1063 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that the PRO 1063 polypeptide has significant 
similarity to the human type IV collagenase protein. Accordingly, it is presently believed that PRO1063 
polypeptide disclosed in the present application is a newly identified collagenase homolog. 

25. Full-length PRQ619 Polypep tides 

The present in vcntionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0619. In particular, cDNA encoding a PR06 19 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer program, it has been found that a full-length native 
sequence PR0619 (shown in Figure 68 and SEQ ID NO: 1 17) has certain amino acid sequence identity with 
VpreB3. Accordingly, it is presently believed that PR0619 disclosed in the present application is a newly 
identified member of the IgG superfamily and may possess activity related to the assembly and/or components 
of the surrogate light chain associated with developing B cells. 
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26 - FuM-lenrth P? Q943 Potyp ^riH^ 
Th C present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0943 . in particular, cDNA encoding a PR0943 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST-2 sequence alignment computer program, it has been found that a full-length 
5 native sequence PR0943 (shown in Figure 70 and SEQ ID NO: 1 19) has amino acid sequence identity with the 
fibroblast growth factor receptor-4 protein. Accordingly, it is presently believed that PR0943 disclosed in the 
present application is a newly identified member of the flbrobaist growth factor receptor family and may possess 
activity typicaJ of that family. 

10 27 - Full-length PB Q1188 Pnly^ ptiri^ 

™ e Pr««>™eM,onpro,ddes newly identffi^ 
referred to in the present application as PROl 188. In particular. cDNA encoding a PROl 188 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As discussed in more detail in Example 1 below, using WU-BLAST-2 sequence altgnmem computer 
programs, i, has been found that a full-length native sequence PROl 188 (shown in Figure 72; SEQ ID NO:124) 
has cenain amino acid sequence identity with nucleotide pyrophosphohydro.ase (SSU831 14 1). Accordingly 
« is presently believed that PR01188 disclosed in the present apphcation is a newly identified member of the 
nucleoude pyrophosphohydrolase family and may possess activ.ry typical of that family of proteins. 

20 2S - Full-length PB QH33 Polyp oid,. 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to tn the present application as PR01133. In particular, cDNA encoding , PROl .33 polypeptide has 
been .dentified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer programs, it has been found that a full-length native 
25 sequence PR01133 (shown in Figure 74 and SEQ ID NO:,29, has certain amino acid sequence identity with 
net™ ,a. Dayhoff accession AF002717J. Accordingly, i, is present.y beiieved Ola, PROH33 disclosed in the 
present apphcation shares at least one related mechanism with netrin. 

29 - Full-length P RQ784 P n ) Yr .>p»i,w 

rcfcnedtotnthepresentapplica^ 

whtch encodes a PR0784 polypeptide, has been identified and iso.ated. as dtsc.osed in further detail in the 

Examples below. 

Using BLAST and FastA sequence ahgnmem computer programs, i, has been found that a full-length 
35 nattve sequence PR0784 (shown in Figure 76 and SEQ ID NO : ,35) has cenam amino acid sequence tdentity 
wrth sec22 homologs. Accordingly, it is presently believed that PR0784 disclosed in the present applicaUon is 
a new,y identified member of the sec22 family and may possess vesicle trafficking activities typical of the sec22 
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family. 



10 
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30 * FulMcnyth P RQ783 Pnlvp ^pfiH^ 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0783. In particular, Applicants have identified and isolated cDNA 
encoding a PR0783 polypeptide, as disclosed in mrther detail in the Examples below. The PR0783-encoding 
clone was Mated from a human fcul kidney library. To Applicants present knowledge, the DNA53996-1442 

nucleotide sequence encodes a novel factor However mine ruct^c 

m-ior. nowever, using BLAST and FastA sequence alignment computer 

programs, some sequence identity to known proteins was found. 
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31 - Full-length P RQ820 Pnlyp ^n^ 
The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to m the present appHcation as PRO820. !n particular, App.icants have identified and isolated cDNA 
encoding a PRO820 polypeptide, as disclosed in further detai. in the Samples below. Using BLAST and FastA 
<°n*uter P ro^^ 

15 sequence identity with the low affinity immunog.obu.in gamma Fc receptor, the IgE high affinity Fc receptor 
and the h,gh affinity immunog.obu.in epsi.on receptor. Accordingly, it is presently believed that PRO820 
poiypeptide disclosed in the present application is a „ew!y identified member of the Fc receptor family. 

32 ' Full-lenrth PW Q1080 Pniyp yp.iw., 
Thepresentinventionprovidesnewlyidentifiedand isolated nucleotide sequences encodingpolypepndes 
referred to in the present application as PRO1080. In particular. Applicants have identified and isolated cDNA 
encode a PRO.080 poiypeptide. as disdosed in funher detai. in the Examples below. Using BLAST and 

FastA se q u= n ceali gn ^entcom P u,erpro g r a ms.Da y hoffdaubase ( version35.45SwissF T ot35),A P p U canuf 0 und 
that the PRO.080 polypeptide has sequence identity with a 39.9 k d protein designated as "YRY1 CAEEL" a 
DnaJ homolog designated -AF027,49_5-. a Dn*. homo.og 2 designated "RNU95727 ,-. and" D na3,C P r3 
deseed - AFO, ,793_. -. Accordingly. ^ re^ts indicate that the PRO.080 peptide disclosed in me 
present appHcation may be a newly identified member of the DnaUike protein family and therefore may be 
involved in protein biogenesis. 



30 



33 ' FulMenrth PR Q1Q79 Polyp ^-w^ 
™™™n^^ 

7„ r m aPP ' iCali0n " PRO, ° 79 ^ CDNA «*■ « PR °' 079 has 

been idennfied and isolated, as disclosed in funher detail in the Examples below 

35 PROFIT I 5 ^ ^ DNA56 ° 50 - 1455 «*— «-» • -1 factor des.gnated herein as 
FRO.079. A.though. using WU-BLAST2 sequence alignment computer programs, some sequence identities 

to known proteins was revealed. 
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34. Full-length FRQ793 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0793 . In particular, cDNA encoding a PR0793 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

The DNA561 10-1437 clone was isolated from a human skin tumor library. As far as is known, the 
DNA56110-1437 sequence encodes a novel factor designated herein as PR0793; using the WU-BLAST-2 
sequence alignment computer program, no significant sequence identities to any known proteins were revealed. 

35. Full-length PRO1016 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1016. In particular, Applicants have identified and isolated cDNA 
encoding a PRO 10 16 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that various portions of the PRO1016 
polypeptide have sequence identity with acyltransf erases. Accordingly, it is presently believed that PRO1016 
polypeptide disclosed in the present application is a newly identified member of the acyltransferase family and 
possesses acyltalation capabilities typical of this family. 



36. Full-length PRO1013 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1013. In particular, Applicants have identified cDNA encoding a 
PRO1013 polypeptide, as disclosed in further detail in the Examples below. The PRO1013-encoding clone came 
from a human breast tumor tissue library. Thus, the PRO10l3-encoding clone may encode a secreted factor 
related to cancer. To Applicants present knowledge, the DNA56410-1414 nucleotide sequence encodes a novel 
factor. Using BLAST and FastA sequence alignment computer programs, some sequence identity with 
KIAA0157 and P120 was revealed. PRO1013 has at least one region in common with growth factor and 
cytokine receptors. 



37. Full-length PRQ937 Polyp eptide 

The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to in the present application as PR0937. In particular. Applicants have identified and isolated cDNA 
encoding a PR0937 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PR0937 polypeptide has significant sequence 
identity with members of the glypican family of proteins. Accordingly, it is presently believed that PR0937 
polypeptide disclosed in the present application is a newly identified member of the glypican family possesses 
properties typical of the glypican family. 
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38. Full-length PRQ842 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0842. In particular, cDNA encoding a PR0842 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA56855-1447 sequence encodes a novel secreted factor designated herein 
5 as PR0842. However, using WU-BLAST2 sequence alignment computer programs, some sequence identity to 
any known proteins were revealed. 

39. Full-length PRQ839 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
1 0 referred to in the present application as PR0839. In particular, cDN A encoding a PR0839 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA56859-1445 sequence encodes a novel factor designated herein as PR0839. 
However, using WU-BLAST-2 sequence alignment computer programs, some sequence identities to known 
proteins was revealed- 

15 

40. Full-length PRO1180 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 180. In particular. Applicants have identified and isolated cDNA 
encoding a PRO 1 180 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
20 FastA sequence alignment computer programs, Applicants found that the PROl 180 polypeptide has significant 
similarity to methyl transferase enzymes. Accordingly, it is presently believed that PROl 180 polypeptide 
disclosed in the present application is a newly identified member of the methyltransferase family and possesses 
activity typical of that family. 

25 41. Full-length PROl 134 Polypeptides 

The present invention provides newly identified and isolated nucleotidesequencesencoding polypeptides 
referred to in the present application as PROl 134. In particular, cDNA encoding a PROl 134 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA56865-1491 clone was isolated from a human fetal liver spleen library using a trapping 
30 technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA56865-1491 clone 
does encode a secreted factor. As far as is known, the DNA56865-1491 sequence encodes a novel factor 
designated herein as PRO 11 34; using the WU-BLAST2 sequence alignment computer program, no significant 
sequence identities to any known proteins were revealed. 

35 42. Full-length PRO830 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO830. In particular, cDNA encoding a PRO830 polypeptide has been 
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identified and isolated, as disclosed in further detail in the Examples below. 

The DNA56866-1342 clone was isolated from a human fetal liver/spleen library using a trapping 
technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA56866-1342 clone 
does encode a secreted factor. As far as is known, the DNA56866-I342 sequence encodes a novel factor 
designated herein as PRO830; using the WU-BLAST-2 sequence alignment computer program, no significant 
sequence identities to any known proteins were revealed. 

43. Full-length PRQ1115 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 115. In particular, cDNA encoding a PROl 1 15 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA56868-1478 sequence encodes a novel transmembrane protein designated 
herein as PROl 115. Although, using WU-BLAST-2 sequence alignment computer programs, some sequence 
identities to known proteins were revealed. 

44. Full-length PRQ1277 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1277. In particular, cDNA encoding a PRO 1277 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer programs, it has been found that a full-length native 
sequence PROl 277 (shown in Figure 1 13 and SEQ ID NO: 179) has certain amino acid sequence identity with 
Coch-5B2 protein (designated "AF012252 J" in the Dayhoff database). Accordingly, it is presently believed 
that PR01277 disclosed in the present application is a newly identified member of the Coch-5B2 protein family 
and may possess the same activities and properties as Coch-5B2. 

45. Full-length PRQ1135 Polypeptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 135. In particular. Applicants have identified and isolated cDNA 
encoding a PROl 135 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that the PROl 135 polypeptide has significant 
similarity to the alpha 1 ,2-mannosidase protein. Accordingly, it is presently believed that PROl 135 polypeptide 
disclosed in the present application is a newly identified member of the mannosidase enzyme family and 
possesses activity typical of that family of proteins. 

46. Full-length PROI114 Polyp eptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1 1 14 interferon receptor. In particular, cDNA encoding a PROl 1 14 
interferon receptor polypeptide has been identified and isolated, as disclosed in further detail in the Examples 
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below. 

Using the WU-BLAST-2 sequence alignment computer program, it has been found that a full-length 
native sequence PR01114 interferon receptor polypeptide (shown in Figure 117 and SEQ ID NO: 183) has 
sequence identity with the other known interferon receptors. Accordingly, it is presently believed that PROl 1 14 
interferon receptor possesses activity typical of other interferon receptors. 

5 

47. Full-length PRQ828 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequencesencoding polypeptides 
referred to in the present application as PR0828. In particular, Applicants have identified and isolated cDNA 
encoding a PR0828 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
10 sequence alignment computer programs, Applicants found that the PR0828 polypeptide has sequence identity 
with glutathione peroxidases. Accordingly, it is presently believed that PR0828 polypeptide disclosed in the 
present application is a newly identified member of the glutathione peroxidase family and possesses peroxidase 
activity and other properties typical of glutathione peroxidases. 

15 48. Full-length PRO1Q09 Polypep tide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1009. In particular, cDNA encoding a PRO1009 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer programs, it has been found that a full-length native 
20 sequence PRO1009 (shown in Figure 122 and SEQ ID NO:194) has certain amino acid sequence identity with 
long-chain acyl-CoA synthetase homolog designated "F69893". Accordingly, it is presently believed that 
PRO 1 009 disclosed in the present application is a newly identified member of the long-chain acyl-CoA synthetase 
family and may possess activity related to this family. 



25 49. Full-length PRO1007 Polyp eptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1007. In particular, Applicants have identified and isolated cDNA 
encoding a PRO1007 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that various portions of the PRO 1007 

30 polypeptide have sequence identity with MAGPIAP. Accordingly, it is presently believed that PRO1007 
polypeptide disclosed in the present application is a newly identified member of the MAGPIAP family and is 
associated with metastasis and/or cell signaling and/or cell replication. 

50. Full-length PRO1056 Polyp eptide 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1056. In particular, cDNA encoding a PRO1056 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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Using the WU-BLAST-2 sequence alignment computer program, it has been found that a full-length 
native sequence PRO1056 (shown in Figure 127 and SEQ ID NO: 199) has amino acid sequence identity with 
a chloride channel protein. Accordingly, it is presently believed that PRO1056 disclosed in the present 
application is a newly identified chloride channel protein homolog. 

5 51. Full-length P RQ826 Polyp eptide 

The present invention providesnewly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0826. In particular, cDNA encoding a PR0826 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

The DNA57694-1341 clone was isolated from a human fetal heart library using a trapping technique 
10 which selects for nucleotide sequences encoding secreted protetns. Thus, the DNA57694-1341 clone does 
encode a secreted factor. As far as is known, the DNA57694-I341 sequence encodes a novel factor designated 
herein as PR0826; using the WU-BLAST-2 sequence alignment computer program, no secant sequence 
identities to any known proteins were revealed. 

15 S2 - Full-length P RQ819 Pnlv r ptjH« 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0819. In particular. cDNA encoding a PR0819 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

The DNA57695-1340 clone was isolated from a human fetal liver spleen library using a trapping 
20 technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA57695-1340 clone 
does encode a secreted factor. As far as is known, the DNA57695-1340 sequence encodes a novel factor 
designated herein as PR0819; using the WU-BLAST-2 sequence alignment computer program, no significant 
sequence identities to any known proteins were revealed. 

25 S3 - Full-length PROlOOfi Pnl y p » p <f^ 

The present invention provides newly identified and isolated nucleotidesequencesencoding polypeptides 
referred to in the present application as PRO1006. In particular. Applicants have identified and isolated cDNA 
encodmg a PRO.006 polypeptide, as disclosed in further detail in the Examples below. The PRO1006-encoding 
clone was isolated from a human uterus library. To Applies present knowledge, the DNA57699-1412 

30 nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment computer programs, 
some sequence identity with a putative tyrosine protein kinase was revealed. 



35 



SA - Full-length P ROH12Pr,lvp ,p»i.l~ 

The present invention provides newly identifiedandisolatednucleotide sequences encoding polypeptides 
referred to in the present application as PR01U2. In particular. App.icams have identified cDNA encoding a 
PRO,, , 2 po.ypep.ide. as disdosed in funher detail in Example 1 below. To Applicants present knowledge the 
DNA57702-1476 nucleotide sequence encodes a nove, factor, although using BLAST and FastA sequence 
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alignment computer programs some sequence identity with other known proteins was found. 

55. Full-length PROl 074 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1074. In particular, Applicants have identified and isolated cDNA 
encoding a PRO 1074 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that the PRO1074 polypeptide has sequence 
identity with galactosyltransferase. Accordingly, it is presently believed that PRO 1074 polypeptide disclosed 
in the present application is a newly identified member of the galactosyltransferase family and possesses 
galactosyltransferase activity. 

56. Full-length PRO1005 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1005. In particular, cDNA encoding a PRO1005 polypeptide has 
btcn idenafied and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA57708-1411 sequence encodes a novel factor designated herein as 
PRO 1005. However, using WU-BLAST2 sequence alignment computer programs, some sequence identities with 
known proteins was revealed. 

57. Full-length PRO1073 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encodingpolypeptides 
referred to in the present application as PRO1073. In particular, cDNA encoding a PRO1073 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA57710 sequence encodes a novel secreted factor designated herein as 
PRO 1073. However, using WU-BLAST2 sequence alignment computer programs, some sequence identities to 
known proteins were revealed. 

58. Full-length PR Q1152 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01152. In particular, cDNA encoding a PR01152 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA5771 1-1501 clone was isolated from a human infant brain library. As far as is known, the 
DNA5771 1-1501 sequence encodes a novel factor designated herein as PR01152; using the WU-BLAST-2 
sequence alignment computer program, no significant sequence identities to any known proteins were revealed. 

59. Full-length PRQ1136 Polypep tide 

The present invention provides newly identifiedand isolated nucleoude sequences encoding polypeptides 
referred to in the present application as PROl 136. In particular, cDNA encoding a PROl 136 polypeptide has 
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been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PR01136 (shown in Figure 147 and SEQ ID NO:219) has amino acid sequence identity with 
PDZ domain-containing proteins. Accordingly, it is presently believed that PROl 136 disclosed in the present 
application is a newly identified member of the PDZ domain-containing protein family and may possess activity 
typical of that family. 

60. Full-length PRQ813 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0813. In particular, Applicants have identified and isolated cDNA 
encoding a PR0813 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PR0813 polypeptide has significant similarity 
to the pulmonary surfactant-associated protein C. Accordingly, it is presently believed that PR0813 polypeptide 
disclosed in the present application is a newly identified pulmonary surfactant-associated protein C homolog. 

61. Full-length PRO809 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO809. In particular. Applicants have identified and isolated cDNA 
encoding a PRO809 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
knowledge, the DNA57836-1338 nucleotide sequence encodes a novel factor. 

62. Full-length PRQ791 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0791. In particular, Applicants have identified and isolated cDNA 
encoding a PR0791 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
knowledge, the DNA57838-1337 nucleotide sequence encodes a novel factor; however, using BLAST and FastA 
sequence alignment computer programs, there does appear to be some sequence identity with MHC-1 antigens, 
indicating that PR0791 may be related thereto in structure and function. 

63. Full-length PRO1004 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1004. In particular, cDNA encoding a PRO1004 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA57844-1410 sequence encodes a novel factor designated herein as 
PRO 1004. However, using WU-BLAST2 sequence alignment computer programs, some sequence identities with 
known proteins were revealed. 
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Full-length PR011I1 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 1 11 . In particular, cDNA encoding a PROll 11 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
5 sequence PROl 1 1 1 (shown in Figure 157 and SEQ ID NO:229) has certain amino acid sequence identity with 
LIG. Accordingly, it is presently believed that PROl 111 disclosed in the present application is a newly 
identified member of this glycoprotein family. 

65. Full-length PR01344 Polypeptides 
10 The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PR01344. In particular, cDNA encoding a PR01344 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PR01344 (shown in Figure 159 and SEQ ID NO;231) has certain amino acid sequence identity 
15 with the factor C protein of Carcinoscorpius rotundicauda. Accordingly, it is presently believed that PROI344 
disclosed in the present application is a newly identified factor C protein and may possess activity typical of that 
protein. 



20 



66 - Full-length PROl 109 Polyp eptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 109. In particular, cDNA encoding a PROl 109 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PROl 109 (shown in Figure 161 and SEQ ID NO:236) has certain amino acid sequence identity 
25 with the human UDP-Gal:GlcNAc galactosyl transferase protein. Accordingly, it is presently believed that 
PROl 109 disclosed in the present application is a newly identified P-galactosyltransferase enzyme and has 
activity typical of those enzymes. 
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67 - Full-length P RQ1383 Polyp eptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROI383. In particular, cDNA encoding a PR01383 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
naive sequence PR01383 (shown in Figure 163 and SEQ ID N0:241) has certain amino acid sequence identity 
with the putative human transmembrane protein rnnb precursor (NMB_HUMAN). Accordingly, it is presently 
betieved that PR01383 disclosed in the present application is a newly identified nmb homolog. 
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68. Full-length PRO1003 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1003. In particular. Applicants have identified and isolated cDNA 
encoding a PRO 1003 polypeptide, as disclosed in further detail in the Examples below. The PROl003-encoding 
clone was isolated from a human breast tumor tissue library. The PROl003-encoding clone was isolated using 
a trapping technique which selects for nucleotide sequences encoding secreted proteins. Thus, the PRO1003- 
encoding clone may encode a secreted factor. To Applicants present knowledge, the UNQ487 (DNA58846- 
1409) nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment computer 
programs, no sequence identities to any known proteins were revealed. 



69. Full-length PRO1108 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1 108. In particular, Applicants have identified and isolated cDNA 
encoding a PRO1108 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA i^quence alignment computer programs, Applicants found that the PROl 108 polypeptide has significant 
similarity to the LPAAT protein. Accordingly, it is presently believed that PROl 108 polypeptide disclosed in 
the present application is a newly idemified LPAAT homolog. 



70. Full-length PROH37 Polypep tides 
The present invention provides newly idemifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 137. In particular, Applicants have identified and isolated cDNA 
encoding a PROl 137 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that the PR01137 polypeptide has sequence 
identity with ribosyltransferases. Accordingly, it is presently believed that PROl 137 polypeptide disclosed in 
the present application is a newly identified member of the ribosyl transferase family and possesses 
ribosyltransferase activity. 



71. Full-length PROl 138 Polyp eptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 138. In particular. Applicants have identified and isolated cDNA 
encoding a PROl 138 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs. Applicants found that the PROl 138 polypeptide has sequence 
identity with CD84 leukocyte antigen. Accordingly, it is presently believed that PROl 138 polypeptide disclosed 
in the present application is a newly identified member of the Ig superfamily and has activity typical of other 
members of the Ig superfamily. 
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72. Full-length PRO1054 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1054. In particular, cDNA encoding a PRO 1054 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
5 native sequence PRO1054 (shown in Figure 174 and SEQ ID NO:256) has amino acid sequence identity with 
one or more of the major urinary proteins. Accordingly, it is presently believed that PRO 1054 disclosed in the 
present application is a newly identified member of the MUP family and may possess activity typical of that 
family. 

10 73. Full-length PRQ994 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0994. In particular, cDNA encoding a PR0994 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
15 native sequence PR0994 (shown in Figure 176 and SEQ ID N0:258) has amino-acid sequence identity with the 
tumor-associated antigen L6. Accordingly t it is presently believed that PR0994 disclosed in the present 
application is a newly identified L6 antigen homolog. 

74. Full-length PROS 12 Polypeptides 

20 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PR0812. In particular, cDNA encoding a PR0812 polypeptide has been 

identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 

native sequence PR0812 (shown in Figure 178 and SEQ ID NO:260) has ammo acid sequence identity with the 
25 prostatic steroid-binding cl protein. Accordingly, it is presently believed that PR0812 disclosed in the present 

application is a newly identified prostatic steroid-binding cl protein homolog. 

75. Full-length PRO 1069 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
30 referred to in the present application as PRO1069. In particular, Applicants have identified and isolated cDNA 
encoding a PRO 1069 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, it was found that the PRO 1069 polypeptide has sequence identity 
with CHIP. Accordingly, it is presently believed that PRO 1069 polypeptide disclosed in the present application 
is a newly identified CHIF polypeptide and is involved in ion conductance or regulation of ion conductance. 

35 
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76. Full-length PRO 1129 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 129. In particular, Applicants have identified and isolated cDNA 
encoding a PROl 129 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that the PROl 129 polypeptide has significant 
similarity to the cytochrome P-450 family of proteins. Accordingly, it is presently believed that PROl 129 
polypeptide disclosed in the present application is a newly identified member of the cytochrome P-450 family 
and possesses activity typical of that family. 

77. Full-length PROl 068 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1068. In particular, cDNA encoding a PRO 1068 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
se^uen^e PRO1068 has amino acid sequence identity with urotensin. Accordingly, it is presently believed that 
PRO 1068 disclosed in the present application is a newly identified member of the urotensin family and may 
possess activity typical of the urotensin family. 

78. Full-length PRO1066 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1066. In particular, Applicants have identified and isolated cDNA 
encoding a PRO 1066 polypeptide, as disclosed in further detail in the Examples below. The PRO1066-encoding 
clone was isolated from a human pancreatic tumor tissue library using a trapping technique which selects for 
nucleotide sequences encoding secreted proteins. Thus, the PRO1066-encoding clone may encode a secreted 
factor. To Applicants present knowledge, the DNA59215-1425 nucleotide sequence encodes a novel factor; 
using BLAST and FastA sequence alignment computer programs, no sequence identities to any known proteins 
were revealed. 

79. Full-length PROl 184 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 184. In particular, Applicants have identified cDNA encoding a 
PROl 184 polypeptide, as disclosed in further detail in the Examples below. To Applicants present knowledge, 
the DNA59220-1514 nucleotide sequence encodes a novel secreted factor. 

80. FulMength PRO1360 Polypep tide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1360. In particular, cDNA encoding a PRO1360 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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As far as is known, the DNA59488-1603 sequence encodes a novel factor designated herein as 
PRO 13 60; using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any 
known proteins were revealed. Some sequence identities were revealed, as indicated below in the examples. 

81. Full-length PRO1029 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1029. In particular, cDNA encoding a PRO1029 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59493-1420 clone was isolated from a human fetal liver spleen library using a trapping 
technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA59493-1420 clone 
does encode a secreted factor. As far as is known, the DNA59493-1420 sequence encodes a novel factor 
designated herein as PRO 1029; using the WU-BLAST2 sequence alignment computer program, no sequence 
identities to any known proteins were revealed. 

82. Full-length PRQ1139 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01139. In particular, Applicants have identified and isolated cDNAs 
encoding PRO 1 139, as disclosed in further detail in the Examples below. Using BLAST and FasiA sequence 
alignment computer programs, Applicants found that the human PROl 139 protein originally identified exhibits 
a significant sequence homology to the a OB receptor associated protein HSOBRGRP1 , described by Baiileul 
et al.. Nucleic Acids Res. 25. 2752-2758 (1997) (EMBL Accession No: Y12670). 

83. Full-length PRO1309 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1309. In particular, cDNA encoding a PRO1309 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PRO 1309 (shown in Figure 196 and SEQ ID NO:278) has certain amino acid sequence identity 
with a protein designated KIAA0416, given the Dayhoff designation AB007876J. Moreover, PRO1309 has 
leucine rich repeats, accordingly, it is presently believed that PRO 1309 disclosed in the present application is 
a newly identified member of the leucine rich protein family and may be involved in protein protein interactions. 

84. Full-length PRO1028 Polyp eptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1028. In particular. Applicants have identified and isolated cDNA 
encoding a PRO1028 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
knowledge, the DNA59603-1419 nucleotide sequence encodes a novel factor. BLAST and FastA sequence 
alignment computer programs showed some sequence identity with proteins such as those designated " A53050" 
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85. Full-length PRO1027 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1027. In particular, Applicants have identified and isolated cDNA 
encoding a PROl 027 polypeptide, as disclosed in further detail in the Examples below . The PRO 1 027-encoding 
clone was identified in a human uterine cervical tissue library. To Applicants present knowledge, the 
DNA59605-1418 nucleotide sequence encodes a novel factor. 



86. Full-length PROl 107 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 107. In particular, Applicants have identified and isolated cDNA 
encoding a PROl 107 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that the PROl 107 polypeptide has some 
similarity 'o ;he PC-1 protein, human insulin receptor tyrosine kinase inhibitor, an alkaline phosphodiesterase, 
and autotaxin. Accordingly, it is presently believed that PROl 107 polypeptide disclosed in the present 
application is a newly identified member of the phosphodiesterase family. 

87. Full-length PROl 140 Polypep tides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding novel multi- 
span transmembrane polypeptides referred to in the present application as PROl 140. In particular, Applicants 
have identified and isolated cDNA encoding a PROl 140 polypeptide, as disclosed in further detail in the 
Examples below. Using BLAST and Fast A sequence alignment computer programs, some sequence identity with 
known proteins was found. 



88. Full-length PROl 106 Polyp eptide 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 106. In particular, Applicants have identified and isolated cDNA 
encoding a PROl 106 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that the PRO1106 polypeptide has significant 
similarity to the peroxisomal calcium-dependent solute carrier. Accordingly, it is presently believed that 
PROl 106 polypeptide disclosed in the present application is a newly identified member of the mitochondrial 
carrier superfamily and possesses transporter activity typical of this family. 



89. Full-length P RQ1291 Polypep tide 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROI291 . In particular, cDNA encoding a PR01291 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PRO 1291 (shown in Figure 208 and SEQ ID NO:29l) has cenain amino acid sequence ideniity 
with the butyrophilin protein. Accordingly, it is presently believed that PR01291 disclosed in the present 
application is a newly identified butyrophilin bomolog and may possess activity typical of that protein. 

5 90. Full-length PRO1105 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 105. In particular, Applicants have identified cDNA encoding a 
PROl 105 polypeptide, as disclosed in further detail in the Examples below. To Applicants present knowledge, 
the DNA596I2-1466 nucleotide sequence encodes a novel factor. There is, however, some sequence identity 
10 with a peroxydase precursor designated in a Dayhoff database as "ATTS1623_1 ". 

91. Full-length PRQ511 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
refe~icd to h the present application as PR051 1. In particular, Applicants have identified and isolated cDNA 
15 encoding a PROS 11 polypeptide, as disclosed in further detail in the Examples below. The PR051 1 -encoding 
clone was isolated from a human colon tissue library. To Applicants present knowledge, the DNA59613-1417 
nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment computer programs, 
sequence identities with RoBo-1, phospholipase inhibitors and a protein designated as "SSC20F10 1" were 
revealed, indicated that PR051 1 may be related to one or more of these proteins. 

20 

92. Full-length PROl 104 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 104. In particular, Applicants have identified and isolated cDNA 
encoding a PROl 104 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
25 knowledge, the DNA59616-1465 nucleotide sequence encodes a novel factor; using BLAST and FastA sequence 
alignment computer programs, some sequence identity appeared with proteins designated as "AB002107 1", 
"AF022991J" and "SP96_DlCDr. 

93. Full-length PROl 100 Polypeptides 

30 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PROl 100. In particular. Applicants have identified cDNA encoding a 
PROl 100 polypeptide, as disclosed in further detail in the Examples below. To Applicants present knowledge, 
the DNA59619-1464 nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment 
computer programs, only some sequence identity with known proteins was revealed. There is some sequence 

35 identity with the yeast hypothetical 42.5 KD protein in TSM1-ARE1 intergenic region (ACCESSION 
NO: 140496), designated " YSCT4 YEAST" . 
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94. Full-length PRQ836 Polypeptides 

The present invention provides newly identified and isolaied nucleotide sequences encoding polypeptides 
referred to in the present application as PR0836. In particular. Applicants have identified and isolated cDNA 
encoding a PR0836 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
knowledge, the DNA59620-1463 nucleotide sequence encodes a novel factor. Using BLAST and FastA 
5 sequence alignment computer programs, there appears to be some sequence identity with SLSl . 

95. Full-length PRQ1141 Polypep tide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 141. In particular, cDNA encoding a PR01141 polypeptide has 
10 been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59625-1498 clone was isolaied from a human ileum tissue library. As far as is known, the 
DNA59625-1498 sequence encodes a novel factor designated herein as PROl 141; using the WU-BLAST2 
sequence "Jigmnent computer program, no sequence identities to any known proteins were revealed. 

15 96. Full-length PROl 1 32 Polyp eptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 132. In particular, cDNA encoding a PROl 132 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer program, it has been found that a full-length native 
20 sequence PROl 132 (shown in Figure 226 and SEQ ID NO:309) has certain amino acid sequence identity with 
enamel matrix serine proteinase 1 and neuropsin. Accordingly, it is presently believed that PROl 132 disclosed 
in the present application is a newly identified member of the serine protease family and may possess protease 
activity typical of this family. 

25 97. Full-length PRQ1346 Polyp eptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as NL7 (UNQ701). In particular, cDNA encoding an NL7 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As disclosed in the Examples below, a clone DNA59776-1600 has been deposited with ATCC. The 
actual nucleotide sequence of the clone can be readily determined by the skilled artisan by sequencing of the 
deposited clone using routine methods in the an. The predicted amino acid sequence can be determined from 
the nucleotide sequence using routine skill. For the NL7 (PR01346) herein, Applicants have identified what 
is believed to be the reading frame best identifiable with the sequence information available at the time of filing. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence NL7 (shown in Figure 228 and SEQ ID NO:314) has certain amino acid sequence identity with 
microfibril-associated glycoprotein 4 (MFA4_HUMAN);ficolin-A - Mus muscuius (AB007813_1); human lectin 
P35 (D63155S6J); ficohn B - Mus muscuius (AFO063217J); human tenascin-R (restriction) (HS518E13 I); 



) 
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the long form of a rat janusin precursor (A45445); fibrinogen-relaied protein HFREP-1 precursor (JN0596); 
a human Tenascin precursor (TENA HUMAN); human CDT6 (HSY16132_1); and angiopoietin- 1 - Mus 
musculus (MMU83509_1). It is presently beiieved that NL7 disclosed in the present application is a novel TIE 
ligand homologue, and may play a role in angiogenesis and/or vascular maintenance and/pr wound healing 
and/or inflammation and/or tumor development and/or growth 

5 

98. Full-length PRQ1131 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 131. In particular, cDNA encoding a PR01131 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
10 Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 

sequence PROl 131 (shown in Figure 230 and SEQ ID NO:319) has certain amino acid sequence identity with 
a lectin-like oxidized LDL receptor. Accordingly, it is presently believed that PRO 1131 disclosed in the present 
application r.iay have at least one mechanism similar to those of the LDL receptors. 

15 99. Full-length PRQ1281 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01281. In particular, cDNA encoding a PR01281 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59820-1549 clone was isolated from a human fetal liver library using a trapping technique 
20 which selects for nucleotide sequences encoding secreted proteins. Thus, as far as is known, the DNA59820- 
1549 sequence encodes a novel factor designated herein as PRO 1281 . Using WU-BLAST2 sequence alignment 
computer programs, some sequence identities to known proteins was found, but determined not to be significant. 

100. Full-length PROI064 Polypeptides 

25 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO 1064. In particular, cDNA encoding a PRO 1064 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59827-1426 clone was isolated from a human fetal kidney library. As far as is known, the 
DNA59827-1426 sequence encodes a novel factor designated herein as PROI064; using the WU-BLAST2 

30 sequence alignment computer program, no significant sequence identities to any known proteins were revealed. 

101. Full-length PRQ1379 Polypep tide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 379. In particular, cDNA encoding a PRO 1379 polypeptide has 
35 been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59828 clone was isolated from a human fetal kidney library. As far as is known, the 
PRO 1379 polypeptide encoded thereby is a novel secreted factor. Using WU-BLAST2 sequence alignment 
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computer programs, sequence identity was found between PRO 1379 and a hypothetical yeast protein 
"YHY8_YEAST" (Dayhoff database; version 35.45 SwissProt 35), particularly at the C-tenninal ends. 
Sequence homologies with other known proteins were revealed, but determined not to be significant. 

102. Full-length PRQ844 Polypeptides 

5 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PR0844. In particular, Applicants have identified and isolated cDNA 
encoding a PR0844 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PR0844 polypeptide has sequence identity 
with serine protease inhibitors. Accordingly, it is presently believed that PR0844 polypeptide disclosed in the 
10 present application is a newly identified serine protease inhibitor and is capable of inhibiting serine proteases. 

103. Full-length PRQ848 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
rtfcrred Li the present application as PRQ848. In particular, Applicants have identified and isolated cDNA 
15 encoding a PRO 848 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PR0848 polypeptide has sequence identity 
with sialyltransferases. Accordingly, it is presently believed that PR0848 polypeptide disclosed in the present 
application is a newly identified member of the sialyltransferase family and possesses sialylation capabilities as 
typical of this family. 

20 

104. Full-length PRO1097 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred lo in the present application as PRO 1097. In particular, Applicants have identified and isolated cDNA 
encoding a PRO 1097 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
25 knowledge, the DNA5984 1-1460 nucleotide sequence encodes a novel factor. Using BLAST and FastA 
sequence alignment computer programs, some sequence identity with proteins designated as "CELK05G33", 
"CRU26344J-, "SPBCI6C6_8", "P_W 13844" and "AF013403" was revealed. 

105. Full-length PRQ1153 Polypeptide 

30 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PROl 153. In particular, cDNA encoding a PROl 153 polypeptide has 

been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 

sequence PROl 153 (shown in Figure 246 and SEQ ID NO:351) has certain amino acid sequence identity with 
35 HPBRII-7 protein submitted to the EMBL Data Library June 1992. Accordingly, it is presently believed that 

PROl 153 disclosed in the present application may be related to HPBRII-7. 
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106. Full-length PRQ1154 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 154. In particular, cDNA encoding a PROl 154 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PROl 154 (shown in Figure 248 and SEQ ID NO:353) aligns with a KIAA0525 protein, designated 
AB011097. PRO 11 54 has a novel N-terminus of 73 amino acids. Accordingly, PROl 154 is believed to be 
novel. PROl 154 also has significant sequence identity with aminopeptidase N, insulin-regulated membrane 
aminopeptidase, throtropin-releasing hormone degrading enzyme and placental leucine aminopeptidasc 
Therefore, PROl 154 is believed to be a novel aminopeptidase, or peptide which degrades peptides. 

107. Full-length PROl 181 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 11 81. In particular, cDNA encoding a PRO 11 81 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DN A59847-15 1 1 clone was isolated from a human prostate tissue library using a trapping technique 
which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA59847-15II clone does 
encode a secreted factor. As far as is known, the DNA59847-151 1 sequence encodes a novel factor designated 
herein as PROl 181; using the WU-BLAST2 sequence alignment computer program, no significant sequence 
identities to any known proteins were revealed. 

108. Full-length PROl 182 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 11 82. In particular, cDNA encoding a PROl 182 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PRO 11 82 (shown in Figure 252 and SEQ ID NO: 357) has amino acid sequence identity with 
the conglutinin protein. Accordingly, it is presently believed that PROl 182 disclosed in the present application 
is a newly identified conglutinin homolog. 

109. Full-length PRQ1155 Polypep tides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 155. In particular, cDNA encoding a PRO 11 55 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using VVU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PROl 155 (shown in Figure 254 and SEQ ID NO:359) has certain amino acid sequence identity with 
neurokinin B. Accordingly, it is presently believed that PROl 155 disclosed in the present application is a newly 
identified member of the tachykinin family. 
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110. Full-length PRQ1156 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 156. In particular, cDNA encoding a PROl 156 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59853-1505 clone was isolated from an adult human he an library using a trapping technique 
5 which selects for nucleotide sequences encoding secreted proteins . Thus, the DN A59853- 1 505 clone may encode 
a secreted factor. As far as is known, the DNA59853-1505 sequence encodes a novel factor designated herein 
as PROl 156. However, using WU-BLAST2 sequence alignment computer programs, some sequence identity 
with known proteins were revealed. 

10 111. Full-length PROl 098 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1098. In particular, Applicants have identified cDNA encoding a 
PRO1098 pol> peptide, as disclosed in further detail in the Examples below. The PRO 1098 -encoding clone was 
isolated from a human lung tissue library. To Applicants present knowledge, the DNA59854-1459 nucleotide 

1 5 sequence encodes a novel factor ; using BLAST and FastA sequence alignment computer programs , no significant 
sequence identities to any known proteins were revealed. Some sequence identity appeared with proteins such 
as the "Env" polyprotein and a methyltransferase. 

112. Full-length PRO 1127 Polypeptides 

20 The present invention provides newly idenufied and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PROl 127. In particular, cDNA encoding a PROl 127 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA60283-1484 clone encodes a secreted factor. As far as is known, the DNA60283-1484 
sequence encodes a novel factor designated herein as PROl 127; using WU-BLAST2 sequence alignment 

25 computer programs, minimal sequence identities to any known proteins were revealed. 

113. Full-length PRQ1126 Polypeptides 

The present invention provides newly idenufied and isolated nucleotidesequences encoding polypeptides 
referred to in the present application as PROl 126. In particular, cDNA encoding a PROl 126 polypeptide has 
30 been idenufied and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PROl 126 (shown in Figure 262 and SEQ ID N0.367) has certain amino acid sequence identity 
with the olfactomedin protein. Accordingly, it is presently believed that PROl 126 disclosed in the present 
application is a newly identified olfactomedin homolog and may possess activity typical of that protein. 

35 
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114. Full-length PRQ1125 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 125. In particular, cDNA encoding a PROl 125 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PROl 125 (shown in Figure 264 and SEQ ID NO:369) has certain amino acid sequence identity with 
transcriptional repressor rco-1. Accordingly, it is presently believed that PR01125 disclosed in the present 
application is a newly identified member of the WD superfamily. 

115. Full-length PROl 186 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 186. In particular, cDNA encoding a PROl 186 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PR01186 (shown in Figure 266 and SEQ ID NO:37I) has amino acid sequence identity with venom 
protein A from Dendroaspis polylepsis polylepsis venom. Accordingly, it is presently believed that PROl 186 
disclosed in the present application is a newly identified member of venom protein A and may share a related 
mechanism. 

116. Full-length PROl 198 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01198. In particular, cDNA encoding a PROl 198 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA60622-1525 sequence encodes a novel factor designated herein as 
PRO 1 198. However, using WU-BLAST2 sequence alignment computer programs, some sequence identity with 
known proteins was found. 

117. Full-length PROl 158 Polypep tide 

The present invention provides newly identified and isolated nucleotidesequcnces encoding polypeptides 
referred to in the present application as PROl 158. In particular, cDNA encoding a PROl 158 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA60625-1507 clone was isolated from a human lung rumor tissue library. As far as is known, 
the DNA60625-1507 sequence encodes a novel factor designated herein as PRO 1158. However, using WU- 
BLAST2 sequence alignment computer programs, some sequence identities with known proteins were shown. 

118. Full-length PROl 159 Polyp eptide 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROH59. In particular, cDNA encoding a PROl 159 polypeptide has 
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been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA60627-1508 clone was isolated from a human peripheral blood granulocyte tissue library using 
a trapping technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA60627- 
1508 clone does encode a secreted factor. As far as is known, the DNA60627-1508 sequence encodes a novel 
factor designated herein as PRO 1159; using the WU-BLAST2 sequence alignment computer program, no 
5 sequence identities to any known proteins were revealed. 

119. Full-length PRO 1124 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 124. In particular, cDNA encoding a PRO 1 124 polypeptide has 
10 been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found thai a full-length native 
sequence PROl 124 (shown in Figure 274 and SEQ ID NO: 377) has amino acid sequence identity with an 
epithelial chloride channel protein from bos taurus. PROl 124 also has sequence identity with ECAM-1. 
Accordingly, it is presently believed that PROl 124 disclosed in the present application is a newly identified cell 
15 membrane protein involved in communication of cells either through ion channels or cell adhesion molecules. 

120. Full-length PRQ1287 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1287. In particular, cDNA encoding a PRO 1287 polypeptide has 
20 been identified and isolated, as disclosed in further detail in the Examples below. 

Usmg.the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PR01287 (shown in Figure 276 and SEQ ID NO:381) has amino acid sequence identity with 
the radical fringe protein from Gallus gallus (GGU82088_1). Accordingly, it is presently believed that PRO 1287 
disclosed in the present application is a newly identified fringe protein homolog and may possess activity typical 
25 of the fringe protein. 

121. Full-length PRQ1312 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01312. In particular, cDNA encoding a PR01312 polypeptide has 
30 been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, some sequence identities with known 
proteins were revealed, but were determined not to be significant. Therefore, as far as is known, the 
DNA6 1873- 1574 sequence encodes a novel transmembrane protein designated herein as PRO 13 12. 

35 122. Full-length PRQ1192 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 192. In particular, cDNA encoding a PROl 192 polypeptide has 
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been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PROl 192 {shown in Figure 280 and SEQ ID NO: 389) has amino acid sequence identity with trout P0- 
like glycoprotein (GEN 12838 IP1). Accordingly, it is presently believed that PRO 11 92 disclosed in the present 
application is a newly identified member of the myelin P0 glycoprotein family. 

5 

123. FulMength PROl 160 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1160. In particular, cDNA encoding a PROl 160 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
10 The DN A62872- 1509 clone was isolated from a human breast tissue library using a trapping technique 

which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA62872-1509 clone does 
encode a secreted factor. As far as is known, the DNA62872-1509 sequence encodes a novel factor designated 
herein as PROl 160; using the WU-BLAST2 sequence alignment computer program, no significant sequence 
io> rtv.ts to any known proteins were revealed. 

15 

124. Full-length PRQ1187 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 187. In particular, cDNA encoding a PROl 187 polypeptide has 
been identified and isolated, as disclosed in funher detail in the Examples below. 
20 As far as is known, the DNA62876-1517 sequence encodes a novel factor designated herein as 

PRO! 187; using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any 
known proteins were revealed. 

125. Full-length PRQ1185 Polypeptides 

25 The present invemionprovidesnewly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PROl 185. In particular, cDNA encoding a PROl 185 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA62881- 15 15 clone encodes a novel factor designated herein as PROl 185; 
using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any known 

30 proteins were revealed. 

126. Full-length PROl 345 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1345. In particular, cDNA encoding a PRO 1345 polypeptide has 
35 been identified and isolated, as disclosed in funher detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PR01345 (shown in Figure 288 and SEQ ID NO:403) has amino acid sequence identity with 
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Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PRO1270 (shown in Figure 296 and SEQ ID NO:414) has amino acid sequence identity with 
the lectin protein (XLU86699 1) of Xenopus laevis. Accordingly, it is presently believed thai PRO 1270 
disclosed in the present application is a newly identified member of the lectin protein family and may possess 
activity typical of that family. 

5 

131. Full-length PRQ1271 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1271, In particular, cDNA encoding a PR01271 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
10 As far as is known, the DNA66309-1538 sequence encodes a novel factor designated herein as 

PRO 127 1 ; using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any 
known proteins were revealed (results further described in the examples below). 

132. Full-length PRQ1375 Polypeptides 

1 5 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO 1375. In particular, cDNA encoding a PRO 1375 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PR01375 (shown in Figure 300 and SEQ ID NO:418) has amino acid sequence identity PUT2. 

20 Accordingly, it is presently believed that PR01375 disclosed in the present application has at least one related 
mechanism of PUT2. 

133. Full-length PRQ1385 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
25 referred to in the present application as PR01385. In particular, cDNA encoding a PR01385 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA68869-1610 clone was isolated from a human tissue library using a trapping technique which 
selects for nucleotide sequences encoding secreted proteins. Thus, the DNA68869-1610 clone does encode a 
secreted factor. As far as is known, the DNA68869-1610 sequence encodes a novel factor designated herein as 
30 PRO 13 85; using the WU-BLAST2 sequence alignment computer program, no significant sequence identities to 
any known proteins were revealed. 

134. Full-length PRQ1387 Polypeptide 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
35 referred to in the present application as PROB87. In particular, cDNA encoding a PR01387 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PRO 1387 (shown in Figure 304 and SEQ ID NO:422) has amino acid sequence identity with 
the myelin pO protein protein precursor (MYPO_HETFR). Accordingly, it is presently believed that PRO 1387 
disclosed in the present application is a newly identified member of the myelin protein family and may possess 
activity typical of that family. 

5 

135. Full-length PRQ1384 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1384. In particular, cDNA encoding a PRO 1384 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
1 0 Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 

sequence PRO 13 84 (shown in Figure 306 and SEQ ID NO:424) has amino acid sequence identity with NKG2-D 
(AF054819_1; Dayhoff database, version 35.45 SwissProt 35). Accordingly, it is presently believed that 
PRO 1384 disclosed in the present application is a newly identified member of the NKG2 family and may possess 
MHC activation/inactivation activities typical of the NKG2 family. 

15 

B. PRO Variants 

In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated that 
PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide changes into 
the PRO DNA, and/or by synthesis of the desired PRO polypeptide. Those skilled in the art will appreciate that 

20 amino acid changes may alter post-translational processes of the PRO, such as changing the number or position 
of glycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native full-length sequence PRO or in various domains of the PRO described herein, 
can be made, for example, using any of the techniques and guidelines for conservative and non-conservative 
mutations set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or 

25 insertion of one or more codons encoding the PRO that results in a change in the amino acid sequence of the 
PRO as compared with the native sequence PRO. Optionally the variation is by substitution of at least one amino 
acid with any other amino acid in one or more of the domains of the PRO. Guidance in determining which 
amino acid residue may be inserted, substituted or deleted without adversely affecting the desired activity may 
be found by comparing the sequence of the PRO with that of homologous known protein molecules and 

30 minimizing the number of amino acid sequence changes made in regions of high homology. Amino acid 
substitutions can be the result of replacing one amino acid with another amino acid having similar structural 
and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid 
replacements. Insertions or deletions may optionally be in the range of about 1 to 5 amino acids. The variation 
allowed may be determined by systematically making insertions, deletions or substitutions of amino acids in the 

35 sequence and testing the resulting variants for activity exhibited by the full-length or mature native sequence. 

PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus 
or C-terminus, or may lack internal residues, for example, when compared with a full length native protein. 
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Certain fragments lack amino acid residues that are not essentia] for a desired biological activity of the PRO 
polypeptide. 

PRO fragments may be prepared by any of a number of conventional techniques. Desired peptide 
fragments may be chemically synthesized. An alternative approach involves generating PRO fragments by 
enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave proteins at sites defined by 
5 particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating the 
desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding 
a desired polypeptide fragment, by polymerase chain reaction (PCR). Oligonucleotides that define the desired 
termini of the DNA fragment art employed at the 5' and 3' primers in the PCR. Preferably, PRO polypeptide 
fragments share at least one biological and/or immunological activity with the native PRO polypeptide disclosed 
10 herein. 

In particular embodiments, conservative substitutions of interest are shown in Table 1 under the heading 
of preferred subsututions. If such subsututions result in a change in biological activity, then more substantial 
changes, denominated exemplary substitutions in Table 1, or as further described below in reference to amino 
acid classes, are introduced and the products screened. 
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Table 1 





Original 


Exemplary 


Preferred 




Residue 


Substitutions 


Substitutions 


5 


Ala (A) 


val; leu; ile 


val 




Arg (R) 


lys; gin; asn 


lys 




Asn (N) 


gin; his; lys; arg 


gin 




Asp (D) 


glu 


glu 




Cys (C) 


ser 


scr 


10 


Gin (Q) 


asn 


asn 




Glu (E) 


asp 


asp 




Gly (G) 


pro; ala 


ala 




His (H) 


asn; gin; lys; arg 


arg 




lie (I) 


leu; val; met; ala; phe; 




15 




norleucine 


leu 




Leu(L) 


norleucine; Ue; val; 








met; ala; phe 


ile 




Lys (K) 


arg; gin; asn 


arg 




Met (M) 


leu; phc; ile 


leu 


20 


Phe (F) 


leu; val; ile; ala; tyr 


leu 




Pro(P) 


ala 


ala 




Ser (S) 


thr 


thr 




Thr (T) 


ser 


scr 




Trp (W) 


tyr; phe 


tyr 


25 


Tyr (Y) 


trp; phe; thr; ser 


phe 




Val (V) 


ile; leu; met; phe; 






ala; norleucine 


leu 



Substantial modifications in functionor immunologicalidentuy of the PRO polypeptide are accomplished 
30 by selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the polypeptide 

backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or 

hyorophobicity of the molecule at the target site, or (c) the bulk of the side chain. Naturally occurring residues 

are divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 
35 (2) neutral hydrophilic: cys, scr, thr; 

(3) acidic; asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

40 Non-conservative substitutions will entail exchanging a member of one of these classes for another class. 

Such substituted residues also may be imroduced into the conservative substitution sites or, more preferably, into 
the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oiigonudeotide-mediated (site- 
directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et aL, Nucl. 

45 Acids Res., 13:4331 (1986); Zoller et ah, Nucl. Acids Res.. 10:6487 (1987)), cassette mutagenesis [Wells et 
al.. Gene, 34:315 (1985)], restriction selection mutagenesis [Wells et al., Philos. Trans. R. Soc. London ScrA 
3T7:415 (1986)] or other known techniques can be performed on the cloned DNA to produce the PRO variant 
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DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such 
amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid 
among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main- 
5 chain conformation of the variant [Oinningbam and Wells, Science. 244: 1081-1085 (1989)]. Alanine is also 
typically preferred because it is the most common amino acid. Further, it is frequently found in both buried and 
exposed positions [Creighton, The Proteins. (W.H. Freeman & Co., N.Y.); Chothia, J. Mol. Biol.. 150 :1 
(1976)1 If alanine substitution does not yield adequate amounts of variant, an isoteric amino acid can be used. 

10 C. Modifications of PRO 

Covalent modifications of PRO are included within the scope of this invention. One type of covalent 
modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic derivatizing 
agent that is capable of reacting with selected side chains or the N- or C- terminal residues of the PRO. 
Derivatization with bi functional agents is useful, for instance, for crosslinking PRO to a water-insoluble support 
15 matrix or surface for use in the method for purifying anti-PRO antibodies, and vice-versa. Commonly used 
crosslinking agents include, e.g., lJ-bis(diazoacetyl)-2-pnenyiethane, glutaraldehyde, N-hydroxysuccinimide 
esters, for example, esters with 4-azidosaIicylic acid, homobi functional imidoesters, including disuccinimidyl 
esters such as 3,3 , -dithiobis(succinimidylpropionate), Afunctional maleimides such as bis-N-maleimido-1,8- 
octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 
glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryl or threonyl residues, methylation of the ce-amino groups of lysine, arginme, and histidine side 
chains [T.E. Creighton, Proteins: Structure and Molecular Properties. W.H. Freeman & Co. f San Francisco, 
pp. 79-86 (1983)], acerylation of the N-terminal amine, and amidation of any C-terrainal carboxyl group. 

Another type of covalent modification of the PRO polypeptide included within the scope of this 
invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native 
giycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties found 
in native sequence PRO (either by removing the underlying glycosylation site or by deleting the glycosylation 
by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are not present in the 
native sequence PRO. In addition, the phrase includes qualitative changes in the glycosylation of the native 
proteins, involving a change in the nature and proportions of the various carbohydrate moieties present. 

Addition of giycosylation sites to the PRO polypeptide may be accomplished by altering the amino acid 
sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more serine 
or threonine residues to the native sequence PRO (for O-linked glycosylation sites). The PRO amino acid 
sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA 
encoding the PRO polypeptide at preselected bases such that codons are generated that will translate into the 
desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by 
chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, e.g., 
in WO 87/05330 published 11 September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem., pp. 259- 
306 (1981). 

Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically 
5 or enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets 
for glycosylation. Chemical deglycosylation techniques are known in the art and described, for instance, by 
Hakimuddin, et al.. Arch. Biochem. Biophvs.. 259:52 (1987) and by Edge et al., Anal. Biochem., 118:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety 
of endo- and exo-glycosidases as described by Thotakura et al., Meth. EnzvmoU 138i350 (1987). 

10 Another type of covalent modification of PRO comprises linking the PRO polypeptide to one of a variety 

of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in 
the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or4, 179,337. 

The PRO of the present invention may also be modified in a way to form a chimeric molecule 
comprising PRO fused to another, heterologous polypeptide or amino acid sequence. 

15 in one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag polypeptide 

which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed 
at the amino- or carboxyl- terminus of the PRO. The presence of such epitope- tagged forms of the PRO can be 
detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the PRO to 
be readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix that binds 

20 to the epitope tag. Various tag polypeptides and their respective antibodies are well known in the art. Examples 
include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its 
antibody 12CA5 [Field et a!., Mol. Cell. Biol.. 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, 
G4, B7 and 9E10 antibodies thereto [Evan et al., Molecular and Cellular Biology. 5;3610-3616 (1985)]; and the 
Herpes Simplex virus glycoprotein D (gD) tag and its antibody fPaborsky et al.. Protein Engineering. 3(6):547- 

25 553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al., BioTechnoloev. £: 1204-1210 
(1988)]; the KT3 epitope peptide [Martin et al., Science . 2^:192-194 (1992)]; an cc-tubulin epitope peptide 
[Skinner et al., J. Biol. Chem. . 26£: 15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz- 
Freyermuth et al., Proc. Natl. Acad. Sci. USA. 87:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO with an 

30 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule (also 
referred to as an "immunoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The Ig fusions 
preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) form of a PRO 
polypeptide in place of at least one variable region within an Ig molecule. In a particularly preferred 
embodiment, the immunoglobulin fusion includes the hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 

35 regions of an IgGl molecule. For the production of immunoglobulin fusions see also US Patent No. 5,428,130 
issued June 27, 1995. 
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D. Preparation of PRO 

The description below relates primarily to production of PRO by culruring cells transformed or 
transfected with a vector containing PRO nucleic acid. It is. of course, contemplated that alternative methods, 
which are well known in the art, may be employed to prepaje PRO. For instance, the PRO sequence, or 
portions thereof, may be produced by direct peptide synthesis using solid-phase techniques [see, e.g., Stewart 
5 et al Solid-Phase Peptide Synthesis. W.H. Freeman Co., San Francisco, CA (1969); Merrifield. J. Am.Chem. 
Soc. . 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using manual techniques or by 
automation. Automated synthesis may be accomplished, for instance, using an Applied Biosy stems Peptide 
Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the PRO may be 
chemically synthesized separately and combined using chemical or enzymatic methods to produce the full-length 
10 PRO. 

1. Isolation of DNA Encoding PRO 
DNA encoding PRO may be obtained from a cDNA library prepared from tissue believed to possess 
the PRO mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be conveniently 
15 obtained from a cDNA library prepared from human tissue, such as described in the Examples. The PRO- 
encoding gene may also be obtained from a genomic library or by known synthetic procedures (e.g., automated 
nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the PRO or oligonucleotides of at least 
about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA 

20 or genomic library with the selected probe may be conducted using standard procedures, such as described in 
Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 
1989). An alternative means to isolate the gene encoding PRO is to use PCR methodology [Sambrook et al., 
supra : Dieffcnbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)]. 
The Examples below describe techniques for screening a cDNA library. The oligonucleotide sequences 

25 selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. 
The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library 
being screened. Methods of labeling are well known in the an, and include the use of radiolabels like 3: P-labeled 
ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high 
stringency, are provided in Sambrook et al., supra. 

30 Sequences identified in such library screening methods can be compared and aligned to other known 

sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the art and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic 

35 libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
conventional primer extension procedures as described in Sambrook et al., supra , to detect precursors and 
processing intermediates of mRNA that may not have been reverse- transcribed into cDNA. 
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2. Selection and Transformation of Host Cells 
Host cells are transfected or transformed with expression or cloning vectors described herein for PRO 
production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting 
trans formants, or amplifying the genes encoding the desired sequences. The culture conditions, such as media, 
temperature, pH and the like, can be selected by the skilled artisan without undue experimentation. In general, 
5 principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in 
Mammalian Cell Biotechnology: a Practical Approach. M. Butler, ed. (IRL Press, 1991) and Sambrook et al.. 
supra . 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the ordinarily 
skilled artisan, for example, CaCl 2 , CaP0 4 , liposome -mediated and electroporation. Depending on the host cell 

10 used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment 
employing calcium chloride, as described in Sambrook et al., supra , or electroporation is generally used for 
prokaryotes. Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as 
described by Shaw et al. , Gene . 23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells 
without such eel! walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology . 

15 52:456-457 (1978) can be employed. General aspects of mammalian cell host system transections have been 
described in U.S. Patent No. 4,399,216. Transformations into yeast are typically carried out according to the 
method of Van Solingenetal., J. Bact. . 130:946(1977) and Hsiao et al., Proc. Natl. Acad. Sci. (USA) . 76:3829 
(1979). However, other methods for introducing DNA inio cells, such as by nuclear microinjection, 
electroporation, bacterial protoplast fusion with intact cells, or polycations, e.g., polybrene, polyomithine, may 

20 also be used. For various techniques for traroforming mammalian cells, see Keown et al., Methods in 
Enzvmology. 185:527-537 (1990) and Mansour et al., Nature. 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, 
or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative 
or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. coli strains are 

25 publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); £. coli XI 776 (ATCC 31,537); E. coli 
strain W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include 
Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter \ Erwinia, Klebsiella, Proteus, Salmonella, 
e.g., Salmonella ryphimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as B. 
subtilis and B. licheniformis (e.g., B, licheniformis 41P disclosed in DD 266,710 published 12 April 1989), 

30 Pseudomonas such as P. aeruginosa, and Streptomyces . These examples are illustrative rather than limiting. 
Strain W3 1 10 is one particularly preferred host or parent host because it is a common host strain for recombinant 
DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For 
example, strain W31 10 may be modified to effect a genetic mutation in the genes encoding proteins endogenous 
to the host, with examples of such hosts including E. coli W31 10 strain 1 A2, which has the complete genotype 
35 tonA ; E. coli W3110 strain 9E4, which has the complete genotype tonA prr3; E. coli W3110 strain 27C7 
(ATCC 55,244), which has the complete genotype tonA ptr3 phoA El 5 (argF-lac)169degP ompTkarf; E. coli 
W31 10 strain 37D6, which has the complete genotype wnA prr3 phoA E15 (argF-lac)169 degP ompT rbs7 
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il\G kaif; E. coli W3110 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion 
mutation; and an E. coli strain having mutant penplasmic protease disclosed in U.S. Patent No. 4,946,783 issued 
7 August 1990. Alternatively, in vitro methods of cloning, e.g . , PCR or other nucleic acid polymerase reactions, 
are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning 
5 or expression hosts for PRO-encoding vectors. Saccharomyces cerevisiae is a commonly used lower eukaryotic 
host microorgamsm. Others include Schizosaccharomyces pombe (Beach and Nurse, Nature . 290: 140 [1981]; 
EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 4,943,529; Fleer et al., 
Bio/Technology. 9:968-975 (1991)) such as, e.g., K. laais (MW98-8C, CBS683, CBS4574; Louvencourt et al., 
J. Bacterid. . 737 [1983]), K. fragilis (ATCC 12,424), K. bulgaricus (ATCC 16,045), K. wickeramii (ATCC 

10 24,178), K. walrii (ATCC 56,500), K. drosophilarum (ATCC 36,906; Van den Berg et al., Bio/Technology. 
8:135 (1990)), K. thermotolerans \ and K. mancianus; yarrowia (EP 402,226); Pichia pastoris (EP 183,070; 
Sreekrishna et al., J. Basic Microbiol.. 28:265-278 (1988J); Candida; Trichoderma reesia (EP 244,234); 
Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA. 76:5259-5263 [1979]); Schwanniomyces such as 
Schwanniomyces occidenialis (EP 394,538 published 31 October 1990); and filamentous fungi such as, e.g., 

15 Neurospora, Penicillium, Toiypocladium (WO 91/00357 published 10 January 1991), and Aspergillus hosts such 
as A niduUms (Ballance et al. t Biochem. Biophvs. Res. Commun.. 112:284-289 [1983]; Tilbum et aJ., Gene . 
26:205-221 [1983]; Yelton et al., Proc. Natl. Acad. Sci. USA . 81 : 1470-1474 [1984]) and A. niger (Kelly and 
Hynes, EMBO J. . 4:475-479 [1985]). Methylotropic yeasts are suitable herein and include, but are not limited 
to, yeast capable of growth on methanol selected from the genera consisting of Hansenula, Candida, Kheckera, 

20 Pichia, Saccharomyces, Tontlopsis, and Rhodotorula. A list of specific species that are exemplary of this class 
of yeasts may be found in C. Anthony, The Biochemistry of Methvlotrophs. 269 (1982). 

Suitable host cells for the expression of glycosylated PRO are derived from multicellular organisms. 
Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as well as plant 
cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS cells. 

25 More specific examples include monkey kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); 
human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, Graham et al., J._ 
Gen Virol. . 36:59 (1977)); Chinese hamster ovary cells/-DHFR (CHO, Uriaub and Chasin, Proc. Natl. Acad. 
Sci. USA . 77:4216 (1980)); mouse Sertoli cells (TM4, Mather, Biol. Reprod.. 23:243-251 (1980)); human lung 
cells (W138, ATCC CCL 75); human liver cells (Hep G2. HB 8065); and mouse mammary tumor (MMT 

30 060562, ATCC CCL51). The selection of the appropriate host cell is deemed to be within the skill in the art. 

3. Selection and Use of a Replicable Vector 
The nucleic acid (e.g., cDNA or genomic DNA) encoding PRO may be inserted into a replicable vector 
for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The vector 
35 may, for example, be in the form of a plasmid, cos mid, viral panicle, or phage. The appropriate nucleic acid 
sequence may be inserted into the vector by a variety of procedures. In general, DNA is inserted into an 
appropriate restriction endonuclease site(s) using techniques known in the an. Vector components generally 
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include, but are not limited to, one or more of a signal sequence, an origin of replication, one or more marker 
genes, an enhancer element, a promoter, and a transcription termination sequence. Construction of suitable 
vectors containing one or more of these components employs standard ligation techniques which are known to 
the skilled artisan. 

The PRO may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
5 heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
at the N-terrniims of the mature protein or polypeptide. In general, the signal sequence may be a component of 
the vector, or it may be a pan of the PRO-encoding DNA that is inserted into the vector. The signal sequence 
may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, 
penicillinase, lpp. or heat-stable emerotoxin II leaders. For yeast secretion the signal sequence may be, e.g., 

10 the yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces a-factor leaders, 
the latter described in U.S. Patent No. 5,010,182), or acid phosphatase leader, the C albicans glucoamylase 
leader (EP 362,179 published 4 April 1990), or the signal described in WO 90/13646 published 15 November 
1990. In mammalian cell expression, mammalian signal sequences may be used to direct secretion of the 
protein, such as signal sequences from secreted polypeptides of the same or related species, as well as viral 

15 secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 
in one or more selected host cells. Such sequences arc well known for a variety of bacteria, yeast, and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2ft plasmid 
origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for 
20 cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 
Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, 
neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients 
not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 
25 An example of suitable selectable markers for mammalian cells are those that enable the identification 

of cells competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine kinase. An 
appropriate host cell when wild-type DHFR is employed is the .CHO cell line deficient in DHFR activity, 
prepared and propagated as described by Urlaub et ah , Proc. Natl. Acad .Sri USA 77:4216 (1980). A suitable 
selection gene for use in yeast is the trpl gene present in the yeast plasmid YRp7 [Stinchcomb et al., Nature . 
30 282:39 (1979); Kingsmanet al., Gene. 7:141 (1979); Tschemper ei al., Gene. 10:157 (1980)]. The trpl gene 
provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example. 
ATCC No. 44076 or PEP4-1 [Jones, Genetics. 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding nucleic 
acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are well 
35 known. Promoters suitable for use with prokaryotic hosts include the ^-lactamase and lactose promoter systems 
[Chang et al., Nature, 275:615 (1978); Goeddel et al., Nature, 281:544 (1979)], alkaline phosphatase, a 
tryptophan (trp) promoter system [Goeddel, Nucleic Acifk P.c 8 :4057 (1980); EP 36,776], and hybrid 
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promoters such as the tac promoter [deBoer et aJ., Proc. Natl. Acad. Sci. USA. 80:21-25 (1983)]. Promoters 
for use in bacterial systems also will contain a Shine-Daigamo (S.D.) sequence operably linked to the DNA 
encoding PRO. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphoglycerate kinase [Hitzeman et ah, J. Biol. Chem.. 255:2073 (1980)] or other glycolytic enzymes [Hess 
et al. p J. Adv. Enzyme Re£., 7:149 (1968); Holland, Biochemistry. 17:4900 (1978)], such as enolase, 
glyceraldehyde-3-phosphate dehydrogenase, hexoldnase,pyruvatedecarboxylase,phosphon^ctokinase,glucose- 
6-phosphate isomerase, 3-phosphogly cerate mutase, pyruvate kinase, triosephosphateisomerase,phosphoglucosc 
isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of transcription 
controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocyiochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, metaJlothionein, glyceraldehyde-3- 
phosphate dehydrogenase; and enzymes responsible for maltose and galactose utilization. Suitable vectors and 
promoters for use in yeast expression are further described in EP 73,657. 

PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 
obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,21 1,504 published 5 July 
1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 
retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g., the 
actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters are 
compatible with the host cell systems. 

Transcription of a DNA encoding the PRO by higher eukaryotes may be increased by inserting an 
enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 
bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from 
mammalian genes (globin, elastase, albumin, cc-fetoprotein, and insulin). Typically, however, one will use an 
enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication 
origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the 
replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a position 5' or 
3' to the PRO coding sequence, but is preferably located at a site 5' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated 
cells from other multicellular organisms) will also contain sequences necessary for the termination of 
transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, 
occasionally 3 untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide 
segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding PRO. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of PRO in 
recombinant vertebrate cell culture are described in Gethuig et al., Nature . 293:620-625 (1981); Mantei et al. ( 
Nature . 281:40-46 (1979); EP 117,060; and EP 117,058. 
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4. Detectine Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quamitate the transcription of mRNA [Thomas, Proc. Nad. 
Acad. Sci. USA . 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be employed 
5 that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes 
or DNA-protein duplexes. The antibodies in rum may be labeled and the assay may be carried out where the 
duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound 
to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 

10 immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quamitate 
directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the 
antibodies may be prepared against a native sequence PRO polypeptide or against a synthetic peptide based on 
the DNA sequences provided herein or against exogenous sequence fused to PRO DNA and encoding a specific 

15 antibody epitope. 

5. Purification of Polypeptide 

Forms of PRO may be recovered from culture medium or from host cell lysates. If membrane-bound, 
it can be released from the membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic 

20 cleavage. Cells employed in expression of PRO can be disrupted by various physical or chemical means, such 
as freeze-thaw cycling, sonication, mechanical disruption, or cell lysing agents. 

It may be desired to purify PRO from recombinant cell proteins or polypeptides. The following 
procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; 
ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as 

25 DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, 
Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal chelating columns 
to bind epitope-tagged forms of the PRO. Various methods of protein purification may be employed and such 
methods are known in the an and described for example in Deutscher, Methods in Enzvmologv. 182 (1990); 
Scopes, Protein Purification: Principles and Practice. Springer- Verlag, New York (1982). The purification 

30 step(s) selected will depend, for example, on the nature of the production process used and the particular PRO 
produced. 

E. Uses for PRO 

Nucleotide sequences (or their complement) encoding PRO have various applications in the art of 
35 molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in the 
generation of anti-sense RNA and DNA . PRO nucleic acid will also be useful for the preparation of PRO 
polypeptides by the recombinant techniques described herein. 
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The full-length native sequence PRO gene, or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the full-length PRO cDNA or to isolate still other cDNAs (for instance, those 
encoding iiarm*ally-occurring variants of PRO or PRO from other species) which have a desired sequence identity 
to the native PRO sequence disclosed herein. Optionally, the length of the probes will be about 20 to about 50 
bases. The hybridization probes may be derived from at least partially novel regions of the full length native 
5 nucleotide sequence wherein those regions may be determined without undue experimentation or from genomic 
sequences including promoters, enhancer elements and introns of native sequence PRO. By way of example, 
a screening method will comprise isolating the coding region of the PRO gene using the known DNA sequence 
to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety of labels, 
including radionucleotides such as "P or "S, or enzymatic labels such as alkaline phosphatase coupled to the 
1 0 probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of the PRO 
gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA to 
determine which members of such libraries the probe hybridizes to. Hybridization techniques are described in 
further detail in the Examples below. 

Any EST sequences disclosed in the present application may similarly be employed as probes, using 
15 the methods disclosed herein. 

Other useful fragments of the PRO nucleic acids include antisense or sense oligonucleotides comprising 
a singe-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target PRO mRNA (sense) 
or PRO DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the present invention, 
comprise a fragmeni of the coding region of PRO DNA. Such a fragment generally comprises at least about 14 
20 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an antisense or a sense 
oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for example, Stein and 
Cohen ( Cancer Res. 48:2659, 1988) and van der Krol et al. ( BioTechniaues 6 :958. 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation 
of duplexes that block transcription or translation of the target sequence by one of several means, including 
25 enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means. 
The antisense oligonucleotides thus may be used to block expression of PRO proteins. Antisense or sense 
oligonucleotides further comprise oligonucleotides having modified sugar-poosphodiester backbones (or other 
sugar linkages, such as those described in WO 91/06629) and wherein such sugar linkages are resistant to 
endogenous nucleases. Such oligonucleotides with resistant sugar linkages are stable in vivo (i.e., capable of 
30 resisting enzymatic degradation) but retain sequence specificity to be able to bind to target nucleotide sequences . 

Other examples of sense or antisense oligonucleotides include those oligonucleotides which are 
covalently linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases 
affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, 
intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or 
35 antisense oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target 
nucleotide sequence. 
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Amisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid 
sequence by any gene transfer method, including, for example, CaP0 4 -mediated DNA transfection, 
electroporation, or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an 
amisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing the target nucleic 
acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral 
vectors include, but are not limited to, those derived from the murine retrovirus M-MuLV, N2 (a retrovirus 
derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (see WO 
90/13641). 

Sense or amisense oligonucleotides also may be introduced into a cell containing the target nucleotide 
sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 
ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule 
or receptor, or block entry of the sense or amisense oligonucleotide or its conjugated version into the cell. 

Alternatively, a sense or an amisense oligonucleotide may be introduced into a cell containing the target 
nucleic acid sequence by formation of an oligonucleotide -lipid complex, as described in WO 90/10448. The 
sense or amisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase. 

The probes may also be employed in PCR techniques to generate a pool of sequences for identification 
of closely related PRO coding sequences. 

Nucleotide sequences encoding a PRO can also be used to construct hybridization probes for mapping 
the gene which encodes that PRO and for the genetic analysis of individuals with genetic disorders. The 
nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome 
using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, 
and hybridization screening with libraries. 

When the coding sequences for PRO encode a protein which binds to another protein (example, where 
the PRO is a receptor), the PRO can be used in assays to identify the other proteins or molecules involved in the 
binding interaction. By such methods, inhibitors of the receptor/ligand binding interaction can be identified. 
Proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitors 
or agonists of the binding interaction. Also, the receptor PRO can be used to isolate correlative ligand(s). 
Screening assays can be designed to find lead compounds that mimic the biological activity of a native PRO or 
a receptor for PRO. Such screening assays will include assays amenable to high -throughput screening of 
chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small 
molecules contemplated include synthetic organic or inorganic compounds. The assays can be performed in a 
variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and 
cell based assays, which are well characterized in the an. 

Nucleic acids which encode PRO or its modified forms can also be used to generate either transgenic 
animals or "knock out" animals which, in rum, are useful in the development and screening of therapeutically 
useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells thai contain a transgene. 
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which transgenic was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic 
stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal 
develops. In one embodiment, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in 
accordance with established techniques and the genomic sequences used to generate transgenic animals that 
contain ceils which express DNA encoding PRO. Methods for generating transgenic animals, particularly 
5 animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. Patent 
Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for PRO transgene incorporation 
with tissue- specific enhancers. Transgenic animals that include a copy of a transgene encoding PRO introduced 
into the germ line of the animal at an embryonic stage can be used to examine the effect of increased expression 
of DNA encoding PRO. Such animals can be used as tester animals for reagents thought to confer protection 
10 from, for example, pathological conditions associated with its overexpression. In accordance with this facet of 
the invention, an animal is treated with the reagent and a reduced incidence of the pathological condition, 
compared to untreated animals bearing the transgene, would indicate a potential therapeutic intervention for the 
pathological condition. 

Alternatively, non-human homologues of PRO can be used to construct a PRO "knock out" animal 

15 which has a defective or altered gene encoding PRO as a result of homologous recombination between the 
endogenous gene encoding PRO and altered genomic DNA encoding PRO introduced into an embryonic stem 
cell of the animal. For example, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in 
accordance with established techniques. A portion of the genomic DNA encoding PRO can be deleted or 
replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor 

20 integration. Typically, several kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are included 
in the vector [see e.g., Thomas and Capecchi, Cell, 51:503 (1987) for a description of homologous 
recombination vectors] . The vector is introduced into an embryonic stem cell line (e.g. , by electropo ration) and 
cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected (see 
e.g., Li et al., Cell, 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal (e.g., 

25 a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in Teratocarcinomas and Embryonic Stem 
Cells: A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 1 13-152]. A chimeric embryo can 
then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to create 
a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells can be 
identified by standard techniques and used to breed animals in which all cells of the animal contain the 

30 homologously recombined DNA. Knockout animals can be characterized for instance, for their ability to defend 
against certain pathological conditions and for their development of pathological conditions due to absence of 
the PRO polypeptide. 

Nucleic acid encoding the PRO polypeptides may also be used in gene therapy. In gene therapy 
applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective 
35 genetic product, for example for replacement of a defective gene. "Gene therapy" includes both conventional 
gene therapy where a lasting effect is achieved by a single treatment, and the administration of gene therapeutic 
agents, which involves the one time or repeated administration of a therapeutically effective DNA or mRNA. 
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Antisense RN As and DNAs can be used as therapeutic agents for blocking the expression of certain genes in 
vivo. It has already been shown that short antisense oligonucleotides can be imported into cells where they act 
as inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell 
membrane. (Zamecnik el al. , Proc. Natl. Acad. Sci. USA 83:4143-4146 [1986]). The oligonucleotides can be 
modified to enhance their uptake, e.g. by substituting their negatively charged phosphodiester groups by 
5 uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo in 
the cells of the intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in vitro 
include the use of liposomes, electroporation, microinjection, cell fusion. DEAE-dextran, the calcium phosphate 

10 precipitation method, etc. The cunently preferred in vivo gene transfer techniques include transfection with viral 
(typically retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau et al., Trends in 
Biotechnology 11, 205-210 [1993]). In some situations it is desirable to provide the nucleic acid source with 
an apent that targets the target cells, such as an antibody specific for a cell surface membrane protein or the 
target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, proteins which bind 

15 to a cell surface membrane protein associated with endocytosis may be used for targeting and/or to facilitate 
uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies for proteins which 
undergo internalization in cycling, proteins that target intracellular localization and enhance intracellular half-life. 
The technique of receptor- mediated endocytosis is described, for example, by Wu et al., J. Biol. Chem. 262, 
4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87, 3410-3414 (1990). For review of gene 

20 marking and gene therapy protocols see Anderson et al., Science 256, 808-813 (1992). 

The PRO polypeptides described herein may also be employed as molecular weight markers for protein 
electrophoresis purposes. 

The nucleic acid molecules encoding the PRO polypeptides or fragments thereof described herein are 
useful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 

25 markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently 
available. Each PRO nucleic acid molecule of the present invention can be used as a chromosome marker. 

The PRO polypeptides and nucleic acid molecules of the present invention may also be used for tissue 
typing, wherein the PRO polypeptides of the present invention may be differentially expressed in one tissue as 
compared to another. PRO nucleic acid molecules will find use for generating probes for PCR, Northern 

30 analysis. Southern analysis and Western analysis. 

The PRO polypeptides described herein may also be employed as therapeutic agents. The PRO 
polypeptides of the present invention can be formulated according to known methods to prepare pharmaceutically 
useful compositions, whereby the PRO product hereof is combined in admixture with a pharmaceutically 
acceptable carrier vehicle. Therapeutic formulations are prepared for storage by mixing the active ingredient 

35 having the desired degree of purity with optional physiologically acceptable carriers, excipients or stabilizers 
(Remington's Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980)), in the form of lyophilized 
formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the 
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dosages and concentrations employed, and include buffers such as phosphate, citrate and other organic acids; 
antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, 
such as serum albumin, gelatin or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino 
acids such as glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides and other 
carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as 
mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™, 
PLURON1CS™ or PEG. 

The formulations to be used for in vivo adrninistration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes, prior to or following lyophilizaiion and reconstirution. 

Therapeutic compositions herein generally are placed into a container having a sterile access port, for 
example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. injection or infusion by intravenous, 
intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial or intralesional routes, topical 
administration, or by sustained release systems. 

Dosages and desired drug concentrations of pharmaceutical compositions of the present invention may 
vary depending on the particular use envisioned. The determination of the appropriate dosage or route of 
administration is well within the skill of an ordinary physician. Animal experiments provide reliable guidance 
for the determination of effective doses for human therapy. Interspecies scaling of effective doses can be 
performed following the principles laid down by Mordenti, J. and Chappell, W. "The use of interspecies scaling 
in toxicokinetics " In Toxicokinetics and New Drug Development, Yacobi et ah, Eds., Pergamon Press, New 
York 1989, pp. 42-96. 

When in vivo administration of a PRO polypeptide or agonist or antagonist thereof is employed, norma] 
dosage amounts may vary from about 10 ng/kg to up to 100 mg/kg of mammal body weight or more per day, 
preferably about 1 /ig/kg/day to 10 mg/kg/day, depending upon the route of administration. Guidance as to 
particular dosages and methods of delivery is provided in the literature; see, for example, U.S. Pat. Nos. 
4,657,760; 5,206,344; or 5,225,212. It is anticipated that different formulations will be effective for different 
treatment compounds and different disorders, that administration targeting one organ or tissue, for example, may 
necessitate delivery in a manner different from that to another organ or tissue. 

Where sustained-release adrninistration of a PRO polypeptide is desired in a formulation with release 
characteristics suitable for the treatment of any disease or disorder requiring adrninistration of the PRO 
polypeptide, microencapsulation of the PRO polypeptide is contemplated. Microencapsulation of recombinant 
proteins for sustained release has been successfully performed with human growth hormone (rhGH), interferon- 
(rhlFN- ), interleukin-2, and MN rgp!20. Johnson et al., Nat. Med. . 2:795-799 (1996); Yasuda, Biomed. 
Iher, 27:1221-1223 (1993); Hora et al., Bio/Technolopv 8-7V>.7^ (1990); Cleland. "Design and Production 
of Single Immunization Vaccines Using Polylactide Polyglycolide Microsphere Systems," in Vaccine Design: 
The Subunit and Adjuvant Approach, Powell and Newman, eds, (Plenum Press: New York, 1995), pp. 439-462; 
WO 97/03692, WO 96/40072, WO 96/07399; and U.S. Pat. No. 5,654,010. 
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The sustained-release formulations of these proteins were developed using poly-lactic-coglycolic acid 
(PLGA) polymer due to its biocompatibiliry and wide range of biodegradable properties. The degradation 
products of PLGA, lactic and glycolic acids, can be cleared quickly within the human body. Moreover, me 
degradabiiity of this polymer can be adjusted from months to years depending on its molecular weight and 
composition. Lewis, -Controlled release of bioactive agents from lactide/glycolide polymer, ' in: M. Chasin and 
5 R. Langer (Eds.), Biodegradable Polymers ax Drug Delivery s y «rm« (M*rr*\ Dekker: New York, 1990), pp. 
Ml. 

This invention encompasses methods of screening compounds to identify those that mimic the PRO 
polypeptide (agonists) or prevent the effect of the PRO polypeptide (antagonists). Screening assays for 
antagonist drug candidates are designed to identify compounds that bind or complex with the PRO polypeptides 
10 encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides 
with other cellular proteins. Such screening assays will include assays amenable to high-throughput screening 
of chemical libraries, making them particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 
15 All assays for antagonists are common in that they call for contacting the drug candidate with a PRO 

polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the PRO polypeptide encoded by the gene identified herein or the 
20 drug candidate is immobilized on a solid phase, e.g., on a microliter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of 
the PRO polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific 
for the PRO polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed 
by adding the non- immobilized component, which may be labeled by a detectable label, to the immobilized 
25 component, e.g. , the coated surface containing the anchored component. When the reaction is complete, the 
non-reacted components are removed, e.g., by washing, and complexes anchored on the solid surface are 
detected. When the originally non-immobilized component carries a detectable label, the detection of label 
immobilized on the surface indicates that completing occurred. Where the originally non-immobilized 
component does not carry a label, completing can be detected, for example, by using a labeled antibody 
30 specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular PRO polypeptide encoded by 
a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, co- 
immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein- 
35 protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers 
(Fields and Song, Nature (London), 340:245-246 (1989); Chien et al., Proc. Natl. Acad . Sri iwa 88: 9578- 
9582 (1991)) as disclosed by Chevray and Nathans. Proc. N«l. Ae«l Vi na no- MaQy 
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transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting 
as the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast 
expression system described in the foregoing publications (generally referred to as the "two-hybrid system") 
takes advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to 
the DNA-binding domain of GAM, and another, in which candidate activating proteins are fused to the 
activation domain. The expression of a GALWacZ reporter gene under control of a GAL4-activated promoter 
depends on reconstitution of GAL4 activity via protein-protein interaction. Colonies containing interacting 
polypeptides are detected with a chromogenic substrate for P-galactosidase. A complete kit 
(MATCHMAKER™) for identifying protein -protein interactions between two specific proteins using the two- 
hybrid technique is commercialJy available from Clontech. This system can also be extended to map protein 
domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for 
these interactions. 

Compounds that interfere with the interaction of a gene encoding a PRO polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
containing the product of the gene and the intra- or extracellular component under conditions and for a time 
allowing for the interaction and binding of the two products. To test the ability of a candidate compound to 
inhibit binding, the reaction is run in the absence and in the presence of the test compound. In addition, a 
placebo may be added to a third reaction mixture, to serve as positive control. The binding (complex formation) 
between the test compound and the intra- or extracellular component present in the mixture is monitored as 
described hereinabove. The formation of a complex in the control reaction(s) but not in the reaction mixture 
containing the test compound indicates that the test compound interferes with the interaction of the test compound 
and its reaction partner. 

To assay for antagonists, the PRO polypeptide may be added to a cell along with the compound to be 
screened for a particular activity and the ability of the compound to inhibit the activity of interest in the presence 
of the PRO polypeptide indicates that the compound is an antagonist to the PRO polypeptide. Alternatively, 
antagonists may be detected by combining the PRO polypeptide and a potential antagonist with membrane-bound 
PRO polypeptide receptors or recombinant receptors under appropriate conditions for a competitive inhibition 
assay. The PRO polypeptide can be labeled, such as by radioactivity, such that the number of PRO polypeptide 
molecules bound to the receptor can be used to determine the effectiveness of the potential antagonist. The gene 
encodmg the receptor can be identified by numerous methods known to those of skill in the art, for example, 
Hgand panning and FACS sorting. Coligan et a!., Current Protocols in Immim i (2 ): Chapter 5 (1991). 
Preferably, expression cloning is employed wherein polyadenylated RNA is prepared from a cell responsive to 
the PRO polypeptide and a cDNA library created from this RNA is divided into pools and used to transfect COS 
cells or other cells that are not responsive to the PRO polypeptide. Transfected cells that are grown on glass 
slides are exposed to labeled PRO polypeptide. The PRO polypeptide can be labeled by a variety of means 
including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and 
incubation, the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are 
prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a 
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single clone thai encodes ihe putative receptor. 

As an alternative approach for receptor identification, labeled PRO polypeptide can be photoaffinity- 
linked with cell membrane or extract preparations that express the receptor molecule. Cross-Iinked material is 
resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, 
resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained 
5 from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 
library to identify the gene encoding the putative receptor. 

In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor 
would be incubated with labeled PRO polypeptide in the presence of the candidate compound. The ability of 
the compound to enhance or block this interaction could then be measured. 
10 More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 

immunoglobulin with PRO polypeptide, and, in particular, antibodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, singled antibodies, anti-idiorypic antibodies, and chimeric 
or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the PRO 
15 polypeptide mat recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 
PRO polypeptide. 

Another potential PRO polypeptide antagonist is an amisense RNA or DNA construct prepared using 
amisense technology, where, e.g. . an amisense RNA or DNA molecule acts to block directly the translation of 
mRNA by hybridizing to targeted mRNA and preventing protein translation. Amisense technology can be used 
20 to control gene expression through triple-helix formation or amisense DNA or RNA. both of which methods are 
based on binding of a polynucleotide to DNA or RNA. For example, the 5 • coding portion of tie polynucleotide 
sequence, which encodes the mature PRO polypeptides herein, is used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 
complementary to a regton of the gene involved in transcription (triple helix - see Lee et al.. Nucl Acid. « 
25 6:3073 (1979); Cooney e, al.. Science, 241: 456 (1988); Dervan e, al., Science, 251:1360 (1991)), thereby 
preventmg transcription and the production of the PRO polypeptide. The antisense RNA oligonucleotide 
hybndizes to the mRNA ,n v,v 0 and blocks translation of the mRNA molecule into the PRO polypeptide 
(anttsense - Olcano. Neujocfe, 56:560 (1991); Qli E odeoxv n .,H f n, i H.. , Anti , rmf , nhihilm „, 
Expxessioa (CRC Press: Boca Raton. FL, 1988). The ol.gonucleo.ides described above can also be delivered 
30 to cells such that the antisense RNA or DNA may be expressed in Ww, , 0 inhibit production of the PRO 
polypeptide. WhenantisenseDNAis used, ol.godeoxyribonucleotides derived from the transla.ion-initiationsite 
e.g.. between about -10 and + 10 positions of the targe, gene nucleotide sequence, are preferred 

Potential antagonists include small molecules that bind to the active she. the receptor binding site or 
growth factor or other relevant binding sue of the PRO polypeptide, thereby blocking the norma, biological 
35 acttvUy of the PRO polypeptide. Examples of small molecules include, but are no. limited to. smaU peptides 
or pep.ide-.ike molecules, preferably so.ub.e peptides, and synthetic non-peptidyl organic or inorganic 
compounds. 
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Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
Ribozymes act by sequence-specifichybridization to the complementary target RNA, followed by endonucleolytic 
cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by known techniques. 
For further details see, e.g., Rossi, Current Biology. 4:469-471 (1994), and PCT publication No. WO 97/33551 
(published September 18, 1997). 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single -stranded 
and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it 
promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 
97/33551, supra. 

These small molecules can be identified by any one or more of the screening assays discussed 
hereinabove and/or by any other screening techniques well known for those skilled in the art. 

PR0189 can be used in assays with W01A6.1 of C. Elegans, phosphodiesterases, transporters and 
proteins which bind to fatty acids, to determine the relative activities of PR0189 against these proteins. The 
results can be applied accordingly. 

F- Anti-PRO Antibodies 

The present invention further provides anti-PRO antibodies. Exemplary antibodies include polyclonal, 
monoclonal, humanized, bispecific, and heteroconjugate antibodies. 

1- Polyclonal Antibodies 
The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, by 
one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent 
and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The 
immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to conjugate 
the immunizing agent to a protein known to be immunogenic in the mammal being immunized. Examples of 
such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine 
thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freund's 
complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 
The immunization protocol may be selected by one skilled in the an without undue experimentation. 

2. Monoclonal Antibodies 
The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature . 256:495 (1975). 
In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that wilt specifically 
bind to the immunizmg agent. Alternatively, the lymphocytes may be immunized in vitro. 
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The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen 
cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes arc then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice. Academic Press, (1986) pp. 59-103]. 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine 
and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be 
cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth or 
survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine 
guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will 
include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of 
HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk 
Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, 
Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies [Kozbor, J. Immunol. . J33:3001 (1984); Brodeur et al. , Monoclonal 
Antibody Production Techniques and Applications. Marcel Dekker, Inc., New York, (1987) pp. 5 1 -63 J . 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of 
monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELJSA). Such techniques and assays are 
known in the an. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochem.. 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
procedures and grown by standard methods rGoding, supra]. Suitable culture media for this purpose include, 
for example, Dulbecco's Modified Eagle's Medium and RPMI-16^0 medium. Alternatively, the hybridoma cells 
may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture 
medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein 
A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those described 
in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated 
and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of binding 
specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the 
invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into expression 
vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
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or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal 
antibodies in the recombinant host cells. The DNA also may be modified, for example, by substituting the 
coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences 
fU.S. Patent No. 4,816,567; Morrison et al., supra! or by covalemly joining to the immunoglobulin coding 
sequence all or part of the coding sequence for a non-immunoglobulin polypeptide. Such a non- immunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for 
the variable domains of one anugen^ombining site of an antibody of the invention to create a chimeric bivalent 
antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are well 
known in the art. For example, one method involves recombinant expression of immunoglobulin light chain and 
modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent 
heavy chain crosslinking. Alternatively, the relevant cysteine residues are substituted with another amino acid 
residue or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known 
in the art. 

3. Human and Humanized Antibodies 
The anti-PRO antibodies of the invention may further comprise humanized antibodies or human 
antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab*. F(ab')2 or other antigen-binding 
subsequences of antibodies) which contain minima] sequence derived from non-human immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 
species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In 
some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise residues which are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which ail or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are 
those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones 
etal., Nature, 321:522-525 (1986); Riechmannetal., Nature, 332:323-329(1988); and Presta, Curr. On. Struct. 
Biol. . 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These non- 
human amino acid residues are often referred to as "import" residues, which are typically taken from an "import" 
variable domain. Humanization can be essentially performed following the method of Winter and co-workers 
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[Jones ct al., Nature , 221:522-525 (1986); Riechmann et al.. Nature . 332:323-327 (1988); Verhoeyen et al., 
Science , 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent 
No. 4,816,567), wherein substantially less than an intact human variable domain has been substituted by the 
corresponding sequence from a non-human species. In practice, humanized antibodies axe typically human 
5 antibodies in which some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the an, including phage 
display libraries [Hoogenboom and Winter, J. Mol. Biol.. 227:381 (1991); Marks et al., J. Mol. Biol.. 222:581 
(1991)]. The techniques of Cole et al. and Boerner et al. are also available for the preparation of human 

10 monoclonal antibodies (Cole et al., Monoclonal Antibodies and Cancer Therapy. Alan R. Liss, p. 77 (1985) and 
Boerner et al., J. Immunol.. 147(l) :86-95 (1991)]. Similarly, human antibodies can be made by introducing 
of human immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin 
genes have been partially or completely inactivated. Upon challenge, human antibody production is observed, 
which closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and 

15 antibody repertoire. This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625.126; 5,633,425; 5,661,016, and in the following scientific publications: Marks et al., 
Bio/Technology 10. 779-783 (1992); Lonberg etal.. Nature 368 856-859(1994): Morrison, Nature 368 . 812-13 
(1994); Fishwild et al., Nature Biotechnology 14. 845-51 (19%); Neuberger, Nature Biotechnology 14. 826 
(1996); Lonberg and Huszar, Intern. Rev. Immunol. 13 65-93 (1995). 

20 

4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities is for the 
PRO, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 
25 subunit. 

Methods for making bispecific antibodies are known in the an. Traditionally, the recombinant 
production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light-chain 
pairs, where the two heavy chains have different specificities [Milstein and Cuello, Nature . 305:537-539 (1983)]. 
Because of the random assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 
30 produce a potential mixture of ten different antibody molecules, of which only one has the correct bispecific 
structure. The purification of the correct molecule is usually accomplished by affinity chromatography steps. 
Similar procedures are disclosed in WO 93/08829, published 13 May 1993, and in Traunecker et al., EMBO 
L. 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) can 
35 be fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin 
heavy-chain constant domain, comprising at least pan of the hinge, CH2, and CH3 regions. It is preferred to 
have the first heavy -chain constant region (CH 1) containing the site necessary for light-chain binding present in 
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ai least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et al.. Methods 
in Enzymologv, 121:210(1986). 

According to another approach described in WO 96/27011, the interface between a pair of antibody 
5 molecules can be engineered to maximize the percentage of heterodimers which are recovered from recombinant 
cell culture. The preferred interface comprises at least a pan of the CH3 region of an antibody constant domain. 
In this method, one or more small amino acid side chains from the interface of the first antibody molecule are 
replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar 
size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large 
10 a m i n o acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for increasing 
the yield of the heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. F(ab')j 
bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical 
15 linkage. Brennan et al. , Scjence 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically 
cleaved to generate F(ab') 2 fragments. These fragments are reduced in the presence of the dithiol complexing 
agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' 
fragments generated are then converted to thioniirobenzoate (TNB) derivatives. One of the Fab*-TNB 
derivatives is then reconvened to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an 
20 equimolar amount of the other Fab' -TNB derivative to form the bispecific antibody. The bispecific antibodies 
produced can be used as agents for the selective immobilization of enzymes. 

Fab' fragments may be directly recovered from E. coli and chemically coupled to form bispecific 
antibodies. Shalaby et al, L Exp. Med. 175:217-225 (1992) describe the production of a fully humanized 
bispecific antibody F(ab') : molecule. Each Fab' fragment was separately secreted from E coli and subjected 
25 to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various technique for making and isolating bispecific antibody fragments directly from recombinant cell 
culture have also been described. For example, bispecific antibodies have been produced using leucine zippers. 
30 Kostelny et aL % J. Immunol. 148(5): 1547-1553 (1992). The leucine zipper peptides from the Fos and Jun 
proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody homodimers 
were reduced at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. 
This method can also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et */., Proc. Natl. Acad Sri 1TSA 90:6444-6448 (1993) has provided an alternative 
35 mechanism for making bispecific antibody fragments. The fragments comprise a heavy-chain variable domain 
(V„) connected to a light-chain variable domain (VJ by a linker which is too short to allow pairing between the 
two domains on the same chain. Accordingly, the V H and V L domains of one fragment are forced to pair with 
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the complementary V L and V H domains of another fragment, thereby forming two antigen-binding sites. Another 
strategy for making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has aJso been 
reported. See, Gruber et al , J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be prepared. 
Tun et a/., J. Immunol. 147:60 (1991). 
5 Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypeptide herein. 

Alternatively, an anti-PRO polypeptide arm may be combined with an arm which binds to a triggering molecule 
on a leukocyte such as a T-ceU receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for IgG 
(FcyR), such as FcvRI (CD64), FcyRIl (CD32) and FcyRIII (CD 16) so as to focus cellular defense mechanisms 
to the cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to localize 
10 cytotoxic agents to cells which express a particular PRO polypeptide. These antibodies possess a PRO-binding 
arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DFTA, DOTA, 
or TETA. Another bispecific antibody of interest binds the PRO polypeptide and further binds tissue factor 
(TF). 

15 5. Heterocoiijugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed 
to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of HIV infection 
[WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared in vitro 

20 using known methods in synthetic protein chemistry, including those involving crosslinking agents. For 
example, inununotoxins may be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include irninothiolate and methyl -4-mercaptobutyrimidate and 
those disclosed, for example, in U.S. Patent No. 4,676,980. 

25 6. Effector Function Engineering 

It may be desirable to modify the antibody of the invention with respect to effector function, so as to 
enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may be 
introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The 
homodimeric antibody thus generated may have improved internalization capability and /or increased 

30 complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et a/., £. 
Ex^Med., 176: 1191-1195(1992) and Shopcs, J. Immunol .. 148:2918-2922(1992). Homodimeric antibodies 
with enhanced anti-tumor activity may also be prepared using beterobifuncuonal cross-linkers as described in 
Wolff etal. Cancer Research. 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that has dual 
Fc regions and may thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al. t Anti- 

35 Cancer Drug Design. 3: 219-230 (1989). 
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7. lmmunoconiugates 
The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic ageni 
such as a chemothcrapemic agent, toxin (e.g. , an enzymaticaHy active toxin of bacterial, fungal, plant, or animal 
origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate) . 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been described above. 
5 Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active 
fragments of diphtheria toxin, exotoxin A chain (from Pseudomonns aeruginosa), ricin A chain, abrin A chain, 
modeccin A chain, alpha-sarcin, AUuriies fordii proteins, dianihin proteins, Phytolaca americana proteins 
(PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, 
gelonin, mitogellin, restrictocin, phenoraycin, enomycin, and the tricothecenes. A variety of radionuclides are 
10 available for the production of radioconjugated antibodies. Examples include 2II Bi, I31 I, l3l ln, and '"Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling 
agents such as N-succmimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminoihiolane (IT), bifunctional 
derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), 
aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hcxanediamine), bis- 
15 diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6- 
diisocyanate), and bis-active fluorine compounds (such as 1 ,5-dinuoro-2,4-dinitrobenzene). For example, a ricin 
immunotoxin can be prepared as described in Vitetta et al., Science , 238 : 1098 (1987). Carbon- 14-labeled 1- 
isothiocyanatobenzyl-3-methyIdiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent 
for conjugation of radionucleoude to the antibody. See W094/1 1026. 
20 In another embodiment, the anubody may be conjugated to a "receptor" (such streptavidin) for 

utilization in rumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed 
by removal of unbound conjugate from the circulation using a clearing agent and then administration of a 
"ligand" (e.g., avidin) that is conjugated to a cytotoxic agent (e.g., a radionucieotide). 

25 8. Immunoliposomcs 

The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing 

the antibody are prepared by methods known in the art, such as described in Epstein et al., Proc. Natl. Acad. 

Sci. USA . 82: 3688 (1985); Hwang et al., Proc. Natl Acad. Sci. USA . 77: 4030 (1980); and U.S. Pat. Nos. 

4,485,045 and 4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 
30 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized phosphatidylethanolamine (PEG- 
PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. 
Fab' fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin 
35 et al .. J. Biol. Chem, . 257 : 286-288 (1982) via a disulfide-intercbange reaction. A chemotherapeutic agent 
(such as Doxorubicin) is optionally contained within the liposome. See Gabizon etal , J. National Cancer Inst.. 
81(19): 1484 (1989). 
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9. Pharmaceutical Compositions of Antibodies 
Antibodies specifically binding a PRO polypeptide identifiedherem, as well as other molecules identified 
by the screening assays disclosed hereinbefore, can be administered for the treatment of various disorders in the 
form of pharmaceutical compositions. 

If the PRO polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing 
5 antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an 
antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that 
specifically binds to the binding domain of the target protein is preferred. For example, based upon the variable- 
region sequences of an antibody, peptide molecules can be designed that retain the ability to bind the target 
protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA 
10 technology. See, e.g., Marasco et al., Proc. Natl. Acad. Sci. USA. 90: 7889-7893 (1993). The fcrrrirtation 
herein may also contain more than one active compound as necessary for the particular indication being treated, 
preferably those with complementary activities that do not adversely affect each other. Alternatively, or in 
addition, the composition may comprise an agent that enhances its function, such as, for example, a cytotoxic 
agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are suitably present in 
15 combination in amounts that are effective for the purpose intended. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and 
poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions, nano-panicles, and nanocapsules) or in macroemulsions. 
20 Such techniques are disclosed in Remington's Pharmaceutical Sciences , supra. 

The formulations to be used for in vivo adrninistration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes. 

Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 
include semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in 
25 the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include 
polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides 
(U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene- 
vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT ™ (injectable 
microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3- 
30 hydroxybutyric acid. While polymers such as ethylene -vinyl acetate and lactic acid-glycolic acid enable release 
of molecules for over 100 days, certain hydrogels release proteins for shorter time periods. When encapsulated 
antibodies remain in the body fora long time, they may denature or aggregate as a result of exposure to moisture 
at 37 °C, resulting in a loss of biological activity and possible changes in immunogeniciry. Rational strategies 
can be devised for stabilization depending on the mechanism involved. For example, if the aggregation 
35 mechanism is discovered to be intermolecularS-S bond formation through thio-disulfide interchange, stabilization 
may be achieved by modifying sulfhydryl residues, lyophilizing from acidic solutions, controlling moisture 
content, using appropriate additives, and developing specific polymer matrix compositions. 
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G. Uses for anti-PRO Antibodies 

The anti-PRO antibodies of the invention have various utilities. For example, anti-PRO antibodies may 
be used in diagnostic assays for PRO, e.g. , detecting its expression in specific cells, tissues, or serum. Various 
diagnostic assay techniques known in the an may be used, such as competitive binding assays, direct or indirect 
sandwich assays and immunoprecipitation assays conducted in either heterogeneous or homogeneous phases 
[Zola, Monoclonal Antibodies: A Manual of Techniques. CRC Press, Inc. (1987) pp. 147-158]. The antibodies 
used in the diagnostic assays can be labeled with a detectable moiety. The detectable moiety should be capable 
of producing, either direcdy or indirectly, a delectable signal. For example, the detectable moiety may be a 
radioisotope, such as 3 H, 14 C, B P, "S, or I23 I, a fluorescent or cherniluminescent compound, such as fluorescein 
isothiocyanate, rhc«iamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta-galactosidase or 
horseradish peroxidase. Any method known in the art for conjugating the antibody to the detectable moiety may 
be employed, including those methods described by Hunter et al.. Nature. 144:945 (1962); David et al., 
Biochemistry. 13:1014 (1974); Pain et ah, J. Immunol. Meth.. 40:219(1981); and Nygren, J. Histochcm. and 
Cvtochem.. 20:407 (1982). 

Anti-PRO antibodies also are useful for the affinity purification of PRO from recombinant cell culture 
or natural sources. In this process, the antibodies against PRO are immobilized on a suitable support, such a 
Sephadex resin or friter paper, using methods well known in the art. The immobilized antibody then is contacted 
with a sample containing the PRO to be purified, and thereafter the support is washed with a suitable solvent that 
will remove substantially all the material in the sample except the PRO, which is bound to the immobilized 
antibody. Finally, the support is washed with another suitable solvent that will release the PRO from the 
antibody. 

The following examples are offered for illustrative purposes only, and are not intended to limit the scope 
of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by reference 
in their entirety. 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacrurer's 
instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, 
VA. 

EXAMPLE 1: Extracellular Domain Ho mology Screening to Identify Novel Polypeptides and cDNA Encoding 
Therefor 

The extracellular domain (ECD) sequences (including the secretion signal sequence, if any) from about 
950 known secreted proteins from the Swiss-Prot public database were used to search EST databases. The EST 
databases included public databases (e.g.. Dayhoff, GenBank), and proprietary databases (e.g. LIFESEQ™, 
Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program WU-BLAST-2 
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(Altscbul et al., Methods in Enzvmologv 266:460-480 (1996)) as a comparison of the ECD protein sequences 
to a 6 frame translation of the EST sequences. Those comparisons with a Blast score of 70 (or in some cases 
90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences 
with the program "phrap' (Phil Green, University of Washington, Seattle, WA). 

Using this extracellular domain homology screen, consensus DNA sequences were assembled relative 
to the other identified EST sequences using phrap. In addition, the consensus DNA sequences obtained were 
often (but not always) extended using repeated cycles of WU-BLAST-2 and phrap to exiend the consensus 
sequence as far as possible using the sources of EST sequences discussed above. 

Based upon the consensus sequences obtained as described above, oligonucleotides were then 
synthesized and used to identify by PCR a cDNA library that contained the sequence of interest and for use as 
probes to isolate a clone of the full-length coding sequence for a PRO polypeptide. Forward and reverse PCR 
primers generally range from 20 to 30 nucleotides and are often designed to give a PCR product of about 100- 
1000 bp in length. The probe sequences are typically 40-55 bp in length. In some cases, additional 
oligonucleotides are synthesized when the consensus sequence is greater than about 1-1 .5kbp. In order to screen 
several libraries for a full-length clone, DNA from the libraries was screened by PCR amplification, as per 
Ausubel et al., Current Protocols in Molecular Biolosy T with the PCR primer pair. A positive library was then 
used to isolate clones encoding the gene of interest using the probe oligonucleotide and one of the primer pairs. 

Hie cDNA libraries used to isolate the cDNA clones were constructed by standard methods using 
commercially available reagents such as those from lnvitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with Nod, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al. , Science . 
253:1278-1280 (1991)) in the unique Xhol and NotI sites. 

EXAMPLE 2 : Isolation of cDNA clones bv Amylase Screenin g 
1- Preparation of oligo dT primed cDNA library 

mRN A was isolated from a human tissue of interest using reagents and protocols from lnvitrogen, San 
Diego, CA (Fast Track 2). This RNA was used to generate an oligo dT primed cDNA library in the vector 
pRK5D using reagents and protocols from Life Technologies, Gaithersburg, MD (Super Script Plasmid System). 
In this procedure, the double stranded cDNA was sized to greater than 1000 bp and the Sall/NotI tinkered cDNA 
was cloned into XhoI/NotI cleaved vector. pRK5D is a cloning vector that has an sp6 transcription initiation 
she followed by an Sfil restriction enzyme site preceding the XhoI/NotI cDNA cloning sites. 

2. Preparation of random primed cDNA library 

A secondary cDNA library was generated in order to preferentially represent the 5' ends of the primary 
cDNA clones. Sp6 RNA was generated from the primary library (described above), and this RNA was used to 
generate a random primed cDNA library in the vector pSST-AMY.O using reagents and protocols from Life 
Technologies (Super Script Plasmid System, referenced above). In this procedure the double stranded cDNA 
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was sized to 500-1000 bp, linkered with blunt to NoU adaptors, cleaved with SRI, and cloned into Sfil/NotI 
cleaved vector. pSST-AMY.O is a cloning vector that has a yeast alcohol dehydrogenase promoter preceding 
the cDNA cloning sites and the mouse amylase sequence (the mature sequence without the secretion signal) 
followed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus, cDNAs cloned into this 
vector that are fused in frame with amylase sequence will lead to the secretion of amylase from appropriately 
transfected yeast colonies. 

3. Transformation and Detection 

DNA from the library described in paragraph 2 above was chilled on ice to which was added 
electrocompetent DH10B bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then 
electroporated as recommended by the manufacturer. Subsequently, SOC media (Life Technologies, 1 ml) was 
added and the mixture was incubated at 37 °C for 30 minutes. The transformants were then plated onto 20 
standard 150 mm LB plates containing ampicillin and incubated for 16 hours (37*0). Positive colonies were 
scraped off the plates and the DNA was isolated from the bacterial pellet using standard protocols, e.g. CsCl- 
gradient. The purified DNA was then carried on to the yeast protocols below. 

The yeast methods were divided into three categories; (1) Transformation of yeast with the 
plasmid/cDNA combined vector; (2) Detection and isolation of yeast clones secreting amylase; and (3) PCR 
amplification of the insert directly from the yeast colony and purification of the DNA for sequencing and further 
analysis. 

The yeast strain used was HD56-5A (ATCC-90785). This strain has the following genotype: MAT 
alpha, ura3-52, leu2-3, leu2-112, his3-ll, his3-15, MAL*, SUC\ GAL\ Preferably, yeast mutants can be 
employed that have deficient post-translational pathways. Such mutants may have translocation deficient alleles 
in seel I, sec!2, sec62 t with truncated secll being most preferred. Alternatively, antagonists (including 
antisense nucleotides and/or ligands) which interfere with the normal operation of these genes, other proteins 
implicated in this post translation pathway (e.g., SEC61p, SEC72p, SEC62p, SEC63p, TDJlp or SSAlp-4p) 
or the complex formation of these proteins may also be preferably employed in combination with the amylase- 
expressing yeast. 

Transformation was performed based on the protocol outlined by Gietz et al. , Nucl. Acid. Res.. 20: 1425 
(1992). Transformed cells were then inoculated from agar into YEPD complex media broth (100 ml) and grown 
overnight at 30°C. The YEPD broth was prepared as described in Kaiser et al., Methods in Yeast Genetics . 
Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 207 (1994). The overnight culture was then diluted to 
about 2 x 10 6 cells/ml (approx. OD«o=0.1) into fresh YEPD broth (500 ml) and regrown to 1 x 10 7 cells/ml 
(approx. 00600=0.4-0.5). 

The cells were then harvested and prepared for transformation by transfer into GS3 rotor bottles in a 
Sorval GS3 rotor at 5,000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile water, 
and centrifuged again in 50 ml falcon tubes at 3,500 rpm in a Beckraan GS-6XR centrifuge. The supernatant 
was discarded and the cells were subsequently washed with LiAc/TE (10 ml, 10 mM Tris-HCl, 1 mM EDTA 
pH 7.5. 100 mM Li : OOCCH 3 ), and resuspended into LiAc/TE (2.5 ml). 
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Transformation took place by mixing the prepared cells (100 jd) with freshly denatured single stranded 
salmon testes DNA (Lofstrand Labs, Gaithersburg, MD) and transforming DNA (1 pg t vol. < 10 pi) in 
microfuge tubes. The mixture was mixed briefly by vortexing, then 40% PEG/TE (600 ^1, 40% polyethylene 
glycol-4000, 10 mM Tris-HCl, 1 mM EDTA, 100 mM Li 3 OOCCH 3 , pH 7.5) was added. This mixmre was 
gently mixed and incubated at 30°C while agitating for 30 minutes. The cells were then heat shocked at 42 °C 
for 15 minutes, and the reaction vessel centrifuged in a microfuge at 12,000 rpm for 5-10 seconds, decanted and 
resuspended into TE (500 pi, 10 mM Tris-HCl, 1 mM EDTA pH 7.5) followed by recentrifugation. The cells 
were then diluted into TE (1 ml) and aliquots (200 /d) were spread onto the selective media previously prepared 
in 150 mm growth plates (VWR). 

Alternatively, instead of multiple small reactions, the transformation was performed using a single, large 
scale reaction, wherein reagent amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as 
described in Kaiser et al., Methods in Yeast Genetics. Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 
208-210 (1994). Transformants were grown at 30°C for 2-3 days. 

The detection of colonies secreting amylase was performed by including red starch in the selective 
growth media. Starch was coupled to the red dye (Reactive Red- 120, Sigma) as per the procedure described by 
Biery et al., Anal. Biochem.. 172:176-179 (1988). The coupled starch was incorporated into the SCD-Ura agar 
plates at a final concentration of 0. 15% (w/v), and was buffered with potassium phosphate to a pH of 7.0 (50- 
100 mM final concentration). 

The positive colonies were picked and streaked across fresh selective media (onto 150 mm plates) in 
order to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase 
secretion were detected by direct incorporation of red starch into buffered SCD-Ura agar. Positive colonies were 
determined by their ability to break down starch resulting in a clear halo around the positive colony visualized 
directly. 

4. Isolation of DNA by PCR Amplification 

When a positive colony was isolated, a portion of it was picked by a toothpick and diluted into sterile 
water (30 pi) in a 96 well plate. At this time, the positive colonies were either frozen and stored for subsequent 
analysis or immediately amplified. An aliquot of cells (5 pi) was used as a template for the PCR reaction in a 
25 pi volume containing: 0.5 pi Klentaq (Clontech, Palo Alto, CA); 4.0 pi 10 mM dNTP's (Perkin Elmer- 
Cerus); 2.5 pi Kentaq buffer (Clontech); 0.25 pi forward oligo 1 ; 0.25 pi reverse oligo 2; 12.5 pi distilled water. 
The sequence of the forward oligonucleotide 1 was: 

5'-TGTAAAACGACGGCCAGT TAAATAGACCTGCAATTATTAATCT- 3' (SEQ ID NO:3) 
The sequence of reverse oligonucleotide 2 was: 

5'-CAGGAAACAGCTATGACC ACCTGCACACCTr;r AAATCCATT- 3' (SEQ ID NO:4) 
PCR was then performed as follows: 

Denature 92 0 C f 5 minutes 

b. 3 cycles of: Denature 92 °C, 30 seconds 
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Anneal 
Extend 



59°C, 30 seconds 
72 °C. 60 seconds 



c. 



3 cycles of: 



Denature 

Anneal 

Extend 



92°C, 30 seconds 
57 °C, 30 seconds 
72 °C, 60 seconds 



5 



d. 



25 cycles of: 



Denature 

Anneal 

Extend 



92°C, 30 seconds 
55 °C, 30 seconds 
72°C, 60 seconds 



10 



e. 



Hold 



4°C 



The underlined regions of the oligonucleotides annealed to the ADH promoter region and the amylase 
region, respectively, and amplified a 307 bp region from vector pSST-AMY.O when no insert was present. 
15 Typically, the first 18 nucleotides of the 5' end of these oligonucleotides contained annealing sites for the 
sequencing primers. Thus, the total product of the PCR reaction from an empty vector was 343 bp. However, 
signal sequence-fused cDNA resulted in considerably longer nucleotide sequences. 

Following the PCR, an aliquot of the reaction (5 fi\) was examined by agarose gel electrophoresis in 
a 1 % agarose gel using a Tris-Borate-EDTA (TBE) buffering system as described by Sambrook et al., s upra. 
20 Clones resulting in a single strong PCR product larger than 400 bp were further analyzed by DNA sequencing 
after purification with a 96 Qiaquick PCR clean-up column (Qiagen Inc., Cnatsworth, CA). 

EXAMPLE 3 : Isolation of cDNA Clones Using Signal Algorithm Analysis 



25 sequence finding algorithm developed by Genentech, Inc. (South San Francisco, CA) upon ESTs as well as 
clustered and assembled EST fragments from public (e.g., GenBank) and/or private (LIFESEQ*, lncyte 
Pharmaceuticals, Inc., Palo Alto, CA) databases. The signal sequence algorithm computes a secretion signal 
score based on the character of the DNA nucleotides surrounding the first and optionally the second methionine 
codon(s) (ATG) at the 5'-end of the sequence or sequence fragment under consideration. The nucleotides 

30 following the first ATG must code for at least 35 unambiguous amino acids without any stop codons. If the first 
ATG has the required amino acids, the second is not examined. If neither meets the requirement, the candidate 
sequence is not scored. In order to determine whether the EST sequence contains an authentic signal sequence, 
the DNA and corresponding amino acid sequences surrounding the ATG codon are scored using a set of seven 
sensors (evaluation parameters) known to be associated with secretion signals. Use of this algorithm resulted 

35 in the identification of numerous rx>lypeptide-encoding nucleic acid sequences. 

EXAMPLE 4: Isolation of cDNA clones En coding Human PRQ7R1 

In order to obtain a cDNA clone encoding PR028I, methods described in Klein et al., Proc. Natl. 
Acad. Sci. USA 93:7108-7113 (1996) were employed with the following modifications. Yeast transformation 
40 was performed with limiting amounts of transforming DNA in order to reduce the number of multiple 
transformed yeast cells. Instead of plasmid isolation from the yeast followed by transformation of E. coli as 



Various polypeptide-encoding nucleic acid sequences were identified by applying a proprietary signal 
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described in Klein et al., supra . PCR analysis was performed on single yeast colonies. PCR primers employed 
were bipartite in order to amplify the insert and a small portion of the invenase gene (allowing to determine that 
the insert was in frame with invenase) and to add on universal sequencing primer sites. 

An invenase library was transformed into yeast and positives were selected on sucrose plates. Positive 
clones were re-tested and PCR products were sequenced. The sequence of one clone, PR0281 , was determined 
5 to contain a signal peptide coding sequence. Oligonucleotide primers and probes were designed using the 
nucleotide sequence of PR0281. A full length plasmid library of cDNAs from human umbilical vein 
endothelium tissue was titered and approximately 100,000 cfu were plated in 192 pools of 500 cfu/poo) into 96- 
well round bottom plates. The plates were sealed and pools were grown overnight at 37°C with shaking 
(200rpm). PCR was performed on the individual cultures using primers. Agarose gel electrophoresis was 
1 0 performed and positive wells were identified by visualization of a band of the expected size. Individual positive 
clones were obtained by colony lift followed by hybridization with JJ P-labeled oligonucleotide. These clones 
were characterized by PCR, restriction digest, and southern blot analyses. 

A full length clone was identified that contained a single open reading frame with an apparent 
translations initiation site at nucleotide positions 80-82, and a srop signal at nucleotide positions 1115-1117 
15 (Figure 1, SEQ ID NO:l). The predicted polypeptide precursor is 345 amino acids long, has a calculated 
molecular weight of approximately 37,205 daltons and an estimated pi of approximately 10. 15. Analysis of the 
full-length PR0281 sequence shown in Figure 2 (SEQ ID NO:2) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 14, multiple transmembrane domains from about 
amino acid position 83 to about amino acid position 105, from about amino acid position 126 to about amino acid 
20 position 146, from about amino acid position 158 to about amino acid position 177, from about amino acid 
position 197 to about amino acid position 216, from about amino acid position 218 to about amino acid position 
238, from about amino acid position 245 to about amino acid position 265, and from about amino acid position 
271 to about amino acid position 290 and an amino acid sequence block having homology to G-protein coupled 
receptor proteins from about amino acid 1 15 to about amino acid 155. Clone UNQ244 (DNA 16422- 1209) has 
25 been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 209929. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 2 (SEQ ID NO:2), evidenced significant 
homology between the PR0281 amino acid sequence and the following Dayhoff sequences: H64634, 
AF033095J, B64815, YBHLECOLI, EMEQUTRJ, AF064763JJ, S53708, A69253, AF035413J2 and 
30 S63281. 



EXAMPLE 5: Isolation of c DNA clones Encoding Human PRQ276 

In order to obtain a cDNA clone encoding PR0276, methods described in Klein et al. . PNAS . 93:7108- 
7113 (1996) were employed with the following modifications. Yeast transformation was performed with limiting 
amounts of tr^f^ming DNA in order to reduce the number of multiple transformed yeast cells. Instead of 
plasmid isolation from the yeast followed by transformation of E. coli as described in Klein et al., supia. PCR 
analyse was performed on single yeast colonies. PCR primers employed were bipartite in order to amplify the 
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insert and a small portion of the invertase gene (allowing to determine that the insert was in frame with invertase) 
and to add on universal sequencing primer sites. 

An invertase library was transformed into yeast and positives were selected on sucrose plates. Positive 
clones were re-tested and PCR products were sequenced. The sequence of one clone, PR0276, was determined 
to contain a signal peptide coding sequence. Oligonucleotide primers and probes were designed using the 
nucleotide sequence of PR0276. A full length plasmid library of cDNAs from human fetal liver cells was titered 
and approximately 100,000 cfu were plated in 192 pools of 500 cfu/pool into 96-weIl round bottom plates. The 
plates were sealed and pools were grown overnight at 37 C with shaking (200rpm). PCR was performed on the 
individual cultures using primers. Agarose gel electrophoresis was performed and positive wells were identified 
by visualization of a band of the expected size. Individual positive clones were obtained by colony lift followed 
by hybridization with 32 P-labeled oligonucleotide. These clones were characterized by PCR, restriction digest, 
and southern blot analyses. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 180-182 and a stop signal at nucleotide positions 933-935 
(Figure 3; SEQ ID NO:5). The predicted polypeptide precursor is 251 amino acids long has a calculated 
molecular weight of approximately 28,801 daltons and an estimated pi of approximately 9.58. The 
transmembrane domains are approximately at amino acids 98-1 16 and 152-172 of the sequence shown in Figure 
4 (SEQ ID NO:6). Clone DNA16435-1208 (UNQ243) has been deposited with the ATCC and is assigned 
ATCC deposit no. 209930 . 

At analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 4 (SEQ ID NO: 6), revealed some sequence 
identity between the PR0276 amino acid sequence and the following Dayhoff sequences: CEG25D72, 
ATT805J2, S69696, GRHRRAT, NPCBAABCD3, AB013149J, P_R85942 and APO00006_5. 

EXAMPLE 6 : Isolation of cDNA clones Encoding Human PRQlftQ- 

A clone designated herein as DNA14187 was isolated as described in Example 2 above from a human 
retina tissue library. The DNAI4187 sequence is shown in Figure 7 (SEQ ID NO:9). Based on the DNA14187 
sequence shown in Figure 7 (SEQ ID NO:9), oligonucleotides were synthesized: 1) to identify by PCR a cDNA 
library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PRO 189. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are 
often designed to give a PCR product of about 100- 1000 bp in length. The probe sequences are typically 40-55 
bp in length. In order to screen several libraries for a full-length clone, DNA from the libraries was screened 
by PCR amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR primer pair. 
A positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide 
and one of the primer pairs. 

A pair of PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 '-TTGACCTATAC AG AG ATTCATC-3 ' (SEQ ID NO: 10); and 
reverse PCR primer S'-CTAAGAACTTCCCTCAGGATTTTO' (SEQ ID NO: II). 
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Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA14187 sequence 
which had the following nucleotide sequence: 
hybridization probe 

5 * -ATG AAG ATC AATTTC AAG AAGC ATGC ACTTCTCCTCTTGC-3 ' (SEQ ID NO: 12). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
5 screened by PCR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 189 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human retina tissue (LIB94). The cDNA 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI 
10 site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector {such as pRJCB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et ah, Science . 253 : 1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
15 PR0189 and the derived protein sequence for PRO 189. 

The entire nucleotide sequence of DNA21624-1391 is shown in Figure 5 (SEQ ID NO:7). Clone 
DNA21 624- 1391 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 200-202 and ending at the stop codon at nucleotide positions 1301-1303 (Figure 5). The predicted 
polypeptide precursor is 367 amino acids long (Figure 6). The full-length PRO 1 89 protein shown in Figure 6 
20 has an estimated molecular weight of about 41,871 daltons and a pi of about 5.06. Clone DNA21624-1391 has 
been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:8, the putative N-glycosylation sites are at about 
amino acids 224-227, 246-249 and 285-288. A domain for cytosolic fatty-acid binding proteins is at amino acids 
25 78-107 of SEQ ID NO:8. The corresponding nucleotides can be routinely determined given the sequences 
provided herein. 

Some sequence identity was found to W01A6.1 and F35Dll.ll, C. Elegans proteins, designated in a 
Dayhoff database as CEW01 A6_10 and CELF35D1 1 J I , respectively. Some sequence identity was also found 
to an antigen to malaria and to restin, designated in a Dayhoff database as P_R05766 and AF014012J, 
30 respectively. Some sequence identity was also found to a microtubule binding protein and to myosin, designated 
in a Dayhoff database as AF041382_I and S07537, respectively. There is also some sequence identity with 1- 
phosphatidylinositol-4, 5-bisphosphate, designated as PIP1 RAT. 

EXAMPLE 7 : Isolation of cDNA clones Encodine Human PRO 190 
35 A clone designated herein as DNA 14232 was isolated as described in Example 2 above from a human 

fetal retina tissue library. The DNA14232 sequence is shown in Figure 10 (SEQ ID NO: 15). Based on the 
DNA14232 sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
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the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 190. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are often designed 
to give a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 bp in length. 
In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 
amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR primer pair. A 
5 positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 
one of the primer pairs. 

A pair of PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5*-CTATACCTACTGTAGCTTCT-3' (SEQ ID NO: 16); and 
reverse PCR primer 5'-TCAGAGAATTCCTTCCAGGA-3' (SEQ ID NO: 17). 
10 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA 14232 sequence 
which had the following nucleotide sequence: 
hybridization probe 

5 ' -ACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTC AAC A-3 ' (SEQ ID NO:18). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
15 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 190 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human retina tissue (LIB94). The cDNA 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, C A. The cDNA was primed with oligo dT containing a NotI 
20 site, linked with blunt to Sail hemikinased adaptors, cleaved with Notl, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the SHI site; see, Holmes et al., Science . 253: 1278-1280 
(1991)) in the unique Xhol and Notl sites. 

DNA sequencing of the clones isolated as described above gave sequences which include the full-length 
25 DNA sequence for PRO190 [herein designated as DNA23334- 1392] (SEQ ID NO: 13) and the derived protein 
sequence for PRO190. 

The entire nucleotide sequence of DNA23334-1392 is shown in Figure 8 (SEQ ID NO: 13). Clone 
DNA23334-1392 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 193-195 and which ends at the stop codon at nucleotide positions 1465-1467 (Figure 8). The predicted 

30 polypeptide precursor is 424 amino acids long (Figure 9). The full-length PRO 190 protein shown in Figure 9 
has an estimated molecular weight of about 48,500 daltons and a pi of about 8.65. Clone DNA23334-I392 has 
been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO: 14, the putative transmembrane domains are at about 

35 amino acids 16-36, 50-74, 147-168, 229-250, 271-293, 298-318 and 328-368 of SEQ ID NO: 14. N- 
glycosylation sites are at about amino acids 128-131. 204-207, 218-221 and 274-377 of SEQ ID NO: 14. The 
corresponding nucleotides can be routinely determined given the sequences provided herein. 
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PRO 190 has sequence identity with at least the following Dayhoff sequences designated as: 
CEZK896 2, JC5023, GMS1_SCHP0 and S44668. 

EXAMPLE 8 : isolation of cDNA clones Encoding Human PRQ34I 

A clone designated herein as DNA 12920 was isolated as described in Example 2 above from a human 
5 placenta tissue library. The DNA12920 sequence is shown in Figure 13 (SEQ ID NO:21). The DNA12920 
sequence was then compared to various EST databases including public EST databases (e.g., GenBank), and a 
proprietary EST database (LiFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST 
sequences. The comparison was performed using the computer program BLAST or BLAST2 [Altschul et al.. 
Methods in Enzvmologv. 266:460480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in 

10 some cases, 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This 
consensus sequence is herein designated DNA25314. Oligonucleotide primers based upon the DNA253I4 
sequence were then synthesized and employed to screen a human placenta cDNA library which resulted in the 
identification of the DNA26288-1239 clone shown in Figure 1 1 . The cloning vector was pRKSB (pRK5B is a 

15 precursor of pRK5D that does not contain the Sfil site; see, Holmes et ai., Science . 253:1278-1280 (1991)), and 
the cDNA size cut was less than 2800 bp. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 380-382, and a stop signal at nucleotide positions 1754-1756 
(Figure 1 1 , SEQ ID NO: 19). The predicted polypeptide precursor is 458 amino acids long, has a calculated 

20 molecular weight of approximately 50,264 daltons and an estimated pi of approximately 8. 17. Analysis of the 
full-length PR0341 sequence shown in Figure 12 (SEQ ID NO:20) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 17. transmembrane domains from about amino acid 
171 to about amino acid 190, from about amino acid 220 to about amino acid 239, from about amino acid 259 
to about amino acid 275, from about amino acid 286 to about amino acid 305, from about amino acid 316 to 

25 about amino acid 335, from about amino acid 353 to about amino acid 378 and from about amino acid 396 to 
about amino acid 417 and potential N-glycosylation sites from about amino acid 145 to about amino acid 147 
and from about amino acid 155 to about amino acid 158. Clone DNA26288-1239 has been deposited with 
ATCC on April 21, 1998 and is assigned ATCC deposit no. 209792. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

30 alignment analysis of the full-length sequence shown in Figure 12 (SEQ ID NO:20), evidenced homology 
between the PR0341 amino acid sequence and the following Dayhoff sequences: S75696, H69788, D69852, 
A69888, B64918, F64752, LPU89276J, G64962, S52977 and S44253. 

EXAMPLE 9: Isolation of cDNA clones E ncoding Human PRO180 
35 A clone designated herein as DNA 12922 was isolated as described in Example 2 above from a human 

placenta tissue library. The DNA 12922 sequence is shown in Figure 16 (SEQ ID NO:24). The DNA12922 
sequence was then compared to various EST databases including public EST databases (e.g.. GenBank), and a 
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proprietary EST database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST 
sequences. The comparison was performed using the computer program BLAST or BLAST2 [Ahschul et aL, 
Methods in Enzvmology. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in 
some cases, 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 
5 An oligonucleotide probe was formed based upon the consensus sequence obtained above. This probe 

had the following sequence. 

S'-ACCTGTTAGAAATGTGGTGGTTTCAGCAAGGCCTCAGTTT (SEQ ID NO:25). 

This probe was used to screen a human placenta library prepared as described in paragraph 1 of Example 2 

above. The cloning vector was pRK5B (pRK5B is a precursor of pRK5D thai does not contain the Sfil site; see, 
10 Holmes et al., Science . 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. A clone 

designated herein as DNA26843-1389 was obtained. 

The entire nucleotide sequence of DNA26843-1389 is shown in Figure 14 (SEQ ID NO:22). Clone 

DNA26843-1389 contains a single open reading frame with an apparent translational initiation site at nucleotide 

positions 121-123 and ending at the stop codon at nucleotide positions 919-921 (Figure 14). The predicted 
15 polypeptide precursor is 266 amino acids long (Figure 15). The full-length FRO180 protein shown in Figure 

15 has an estimated molecular weight of about 29,766 daltons and a pi of about 8.39. Clone DNA26843-1389 

has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 

the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO:23, the transmembrane dornains are at about 
20 amino acids 13-33 (type II), 54-73, 94-113, 160-180 and 122-141 of SEQ ID NO:23. N-myristoylation sites 

are at about amino acids 57-62, 95-100, 99-104, 124-129 and 183-188 of SEQ ID NO:23. The corresponding 

nucleotides can be routinely determined given the sequences provided herein. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 15 (SEQ ID NO:23), evidenced some sequence 
25 identity between the PRO180 amino acid sequence and the following Dayhoff sequences: CEC33A11_2, 

CEG11E6_5, CELW03A5_1 AND PEU83861 2 (NADH dehydrogenase subunit 4L, mitochondrion). 



EXAMPLE 10 : Isolation of cDNA clones Encoding Human PRO 194 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

30 in Example 1 above. This consensus sequence is herein DNA19464. Based on the DNA19464 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 194. PCR 
primers (forward and reverse) were synthesized based upon the DNA 19464 sequence. Additionally, a synthetic 
oligonucleotide hybridization probe was constructed from the consensus DNA 19464 sequence. 

35 to order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 194 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetaJ lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0194 [herein designated as DNA26844-1394] (SEQ JD NO:27) and the derived protein sequence for 
PR0194. 

The entire nucleotide sequence of DNA26844-1394 is shown in Figure 17 (SEQ ID NO:27). Clone 
5 DNA26844-1394 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 81-83 and ending ai the stop codon at nucleotide positions 873-875 (Figure 17). The predicted 
polypeptide precursor is 264 amino acids long (Figure 18). The full-length PR0194 protein shown in Figure 
18 has an estimated molecular weight of about 29,665 daltons and a pi of about 9.34. Analysis of the full-length 
PR0194 sequence shown in Figure 18 (SEQ ID NO:28) evidences the presence of various important 
10 polypeptides domains as shown in Figure 18. Clone DNA26844-1394 has been deposited with ATCC on June 
2, 1998 and is assigned ATCC deposit no. 209926. 

Analysis of the amino acid sequence of the full-length PRO 194 polypeptide suggests that it does not 
exhioi: significant sequence similarity to any known human protein. However, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some homology between the PR0194 amino acid sequence and 
15 the following Dayhoff sequences, HUMORFTJ, CET07F10J, ATFCA9J2, F64934, YDJX_ECOU, 
ATAF00065719F29G20.19, H70002, S76980. H64934 and S76385. 

EXAMPLE 1 1 : isolation of cDNA clones Encoding Human PRO203 

A clone designated herein as DNA15618 was isolated as described in Example 2 above from a human 
20 fetal lung tissue library. The DNA15618 sequence is shown in Figure 21 (SEQ ID NO:31). Oligonucleotide 
probes were generated from the sequence of the DNA 156 1 8 molecule and were used to screen a human fetal lung 
library (LIB26) prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B 
(pRKSB is a precursor of pRK5D that does not contain the Sfil she; see, Holmes et ai., Science . 253: 1278-1280 
(1991)), and the cDNA size cut was less than 2800 bp. 
25 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 159-161 and ending at the stop codon found at nucleotide 
positions 1200-1202 (Figure 19; SEQ ID NO:29). The predicted polypeptide precursor is 347 amino acids long, 
has a calculated molecular weight of approximately 39,870 daltons and an estimated pi of approximately 6.76. 
Analysis of the full-length PRO203 sequence shown in Figure 20 (SEQ ID NO:30) evidences the presence of 
30 the following: a type II transmembrane domain at about amino acid 64 to about amino acid 87; possible N- 
glycosylation sites at about amino acid 147 to about amino acid 150, about amino acid 155 to about amino acid 
158, and about amino acid 237 to about amino acid 240; sequence identity with heavy -metal -associated domain 
proteins at about amino acid 23 to about amino acid 45, and sequence identity with D-isomer specific 2- 
hydroxyacid dehydrogenase at about amino acid 24 to about amino acid 34. Clone DNA30862-1396 was 
35 deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 209920. 

Analysis of the amino acid sequence of the full-length PRO203 polypeptide suggests that it possesses 
sequence similarity to GST ATPase, thereby indicating that PRO203 may be a novel GST ATPase. More 
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specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the 
PRO203 amino acid sequence and the following Dayhoff sequences, AF008124_1, CFRCDIGEN1, and 
P_R82566. 

EXAMPLE 12 : Isolation of cDNA clones Encoding Human PRO290 

An expressed sequence lag (EST) DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified that had homology to beige and FAN. An oligonucleotide probe based 
upon the identified EST sequence was then synthesized and used to screen human fetal kidney cDNA libraries 
in an attempt to identify a full-length cDNA clone. The oligonucleotide probe had the following sequence: 
5' TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTG 3' (SEQ ID NO:34). 

RNA for construction of cDNA libraries was isolated from human fetal kidney tissue. The cDNA 
libraries used to isolate the cDNA clones encoding human PRO290 were constructed by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego. CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see , Holmes et al. , Science 
253:1278-1280 (1991)) in the unique Xhol and NotI. 

A cDNA clone was identified and sequenced in entirety. The entire nucleotide sequence of DNA35680- 
1212 is shown in Figure 22 (SEQ ID NO:32). Clone DNA35680- 121 2 contains a single open reading frame with 
an apparent translation^ initiation site at nucleotide positions 293-295, and a stop codon at nucleotide positions 
3302-3304 (Figure 22; SEQ ID NO:32). The predicted polypeptide precursor is 1003 amino acids long. 

It is currently believed that the PRO290 polypeptide is related to FAN and/or beige. Clone DNA35680- 
1212 has been deposited with ATCC and is assigned ATCC deposit no. 209790. It is understood that the 
deposited clone has the actual correct sequence rather than the representations provided herein. The full-length 
PRO290 protein shown in Figure 23 has an estimated molecular weight of about 112,013 daltons and a pi of 
about 6.4. 

EXAMPLE 13: Isolation of cDNA Clones Encoding Human PRQ874 

A consensus DNA sequence designated herein as DNA36459 was identified using phrap as described 
in Example 1 above. Based on the DNA36459 consensus sequence, oligonucleotides were synthesized: 1) to 
identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a 
clone of the coding sequence for PR0874. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -TCGTGCCCAGGG GCTGATGTGC-3 ' (SEQ ID NO:37); and 
reverse PCR primer 5 ' -GTCTTTACCC AGCCCCGGG ATGCG-3 ' (SEQ ID NO:38). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36459 
sequence which had the following nucleotide sequence: 
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hybridization probe 

5'-GGCCrAATCCAACGTTCTGTCITCAATCTGCAAATCTATGGGGTCCTGGG-3' (SEQ ID NO:39). 

In order 10 screen several libraries for a source of a clone, DNA from ihe libraries was screened by 
PCR amplification with the PGR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0874 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction 
of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the DNA sequence for PR0874 [herein 
designated as DNA4062 1-1440] (SEQ ID NO:35) and the derived protein sequence for PR0874. 

The entire nucleotide sequence of DNA4O62I-1440 is shown in Figure 24 (SEQ ID NO:35). Clone 
DNA4062M440 contains a single open reading frame ending at the stop codon at nucleotide positions 964-966 
(Figure 24). The predicted polypeptide encoded by DNA40621- 1440 is 321 amino acids long (Figure 25). The 
PR0874 protein shown in Figure 25 has an estimated molecular weight of about 36, 194 daltons and a pi of about 
9.85. Analysis of the PR0874 sequence shown in Figure 25 (SEQ ID NO:36) evidenced the presence of the 
following: a type II transmembrane domain at about amino acids 57-80; additional transmembrane domains at 
about -wno acids 1 10-126, 215-231, and 254-274; potential N-glycosylation sites at about amino acids 16-19, 
27-30, and 289-292; sequence identity with hypothetical YBR002c family proteins at about amino acids 276-287; 
and sequence identity with ammonium transporter proteins at about amino acids 204-230. Clone DNA40621- 
1440 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 209922. 

Analysis of the amino acid sequence of the PR0874 polypeptide suggests that it is a novel multi-span 
transmembrane protein. However, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
sequence identity between the PR0874 amino acid sequence and the following Dayhoff sequences: S67049, 
AF054839J, S73437, S52460, and HIVU80570_1. 

EXAMPLE 14 : Isolation of cDNA Clones Encoding Human PRO710 

A yeast screening assay was employed to identify cDNA clones thai encoded potential secreted proteins. 
Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 
designated as DNA38190) is shown in Figure 28 (SEQ ID NO:42). Based on the DNA38190 sequence shown 
in Figure 28, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the 
sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO710. 
In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 
amplification, as per Ausubel et ah, Current Prot ocols in Molecular Biology, with the PCR primer pair. A 
positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 
one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -TTCCGCA AAGAGTTCT ACG AGGTGG-3 ' (SEQ ID NO:43) 
reverse PCR primer 5 ' - ATTGAC AAC ATTG ACTG GC CT ATGGG - 3 ' (SEQ ID NO:44) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA38190 sequence 
which had the following nucleotide sequence 
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hybridization probe 

5'-GTGGATGCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTCCAGTGTGA-3' (SEQ ID NO:45) 

In order to screen several libraries for a source of a fuJl-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO710 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 
The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invirrogen, San Diego, CA. The cDNA was primed with oligo dT 
containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al. t Science . 
253 : 1278-1280 (1991)) in the unique Xho] and NotI sites. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 67-69 and ending at the stop codon found at nucleotide positions 
; 7*5- 1767 (Figure 26, SEQ ID NO:40). The predicted polypeptide precursor is 566 amino acids long, has a 
calculated molecular weight of approximately 65,555 daltons and an estimated pi of approximately 5.44. 
Analysis of the full-length PRO710 sequence shown in Figure 27 (SEQ ID NO:41) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 32, a transmembrane domain from 
about amino acid 454 to about amino acid 476, an aminoacyl -transfer RNA synthetase class-II signature sequence 
from about amino acid 6 to about amino acid 26 and potential N-glycosylation sites from about amino acid 1 1 1 
to about amino acid 1 14, from about amino acid 146 to about amino acid 149 and from about amino acid 292 
to about amino acid 295. Clone DNA44 16 1-1434 has been deposited with ATCC on May 27, 1998 and is 
assigned ATCC deposit no. 209907. 

Analysis of the amino acid sequence of the full-length PRO710 polypeptide suggests that it possesses 
significant sequence similarity to the CDC45 protein, thereby indicating that PRO710 may be a novel CDC45 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PRO710 amino acid sequence and the following Dayhoff sequences, 
HSAJ3728J, CEF34D10J, S64939, UMU50276J, TRHY SHEEP, CELT14E8J, RNA1_YEAST, 
LVU89340J, HSU80736J and CEZK337_2. 

EXAMPLE 15: Isolation of cDNA clones E ncoding Human PROU51 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA40665. Based on the DNA40665 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01151. 
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PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5*-CCAGACGCTGCTCTTCGAAAGGGTC-3' (SEQ ID NO:48) 
reverse PCR primer 5'-GGTCCCCGTAGGCCAGGTCCAGC-3' (SEQ ID NO:49) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40665 
sequence which bad the following nucleotide sequence 
5 hybridization probe 

S'-CTACTTCTTCAGCCTCAATGTGCACAGCTGGAATTACAAGGAGACGTACGO* (SEQ ID NO:50) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROH51 gene using the probe oligonucleotide and one of the PCR primers. RNA 
10 for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 151 (designated herein as DNA44694-1500 fFigure 29, SEQ ID NO:46]; and the derived protein sequence 
for PROl 151. 

The entire nucleotide sequence of DNA44694-1500 is shown in Figure 29 (SEQ ID NO:46). Clone 
1 5 DNA44694- 1500 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 272-274 and ending at the stop codon at nucleotide positions 1049-1051 (Figure 29). The predicted 
polypeptide precursor is 259 amino acids long (Figure 30). The full-length PROl 151 protein shown in Figure 
30 has an estimated molecular weight of about 28,770 daltons and a pi of about 6. 12. Analysis of the full-length 
PROl 151 sequence shown in Figure 30 (SEQ ID NO:47) evidences the presence of the following: a signal 
20 peptide from about amino acid 1 to about amino acid 20, a potential N-glycosyiation site from about amino acid 
72 to about amino acid 75 and amino acid sequence blocks having homology to Clq domam^ntaining proteins 
from about amino acid 144 to about amino acid 178, from about amino acid 78 to about amino acid 1 1 1 and from 
about amino acid 84 to about amino acid 117. Clone UNQ581 (DNA44694-1500) has been deposited with 
ATCC on August 11, 1998 and is assigned ATCC deposit no. 203114. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 30 (SEQ ID NO:47), evidenced significant 
homology between the PROl 15 1 amino acid sequence and the following Dayhoff sequences: ACR3 HUMAN, 
HP25_TAMAS, HUMCIQB2J, P_R99306\ CA1FHUMAN, JX0369, CA24_HUMAN, S32436, P_R28916 
and CA54 HUMAN. 



EXAMPLE 16: Isolation of cDNA clones Encoding Human PRO 1282 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA33778. Based on theDNA33778 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of mterest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01282. 
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PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'TCTTC.AaccnrTTnrnr a Arrrrr ( S £Q ID NO:53); and 
reverse PCR primer S'TTGCTCACATPP AnnrnnrAnny (SEQ ID NO:54). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA33778 sequence which had the following nucleotide sequence: 
5 hybridization probe 

5 'TGG ATGTTGTCC AGAC AACC AGCTGGAGCTGTATCCGAGGC3 * (SEQ ID NO:55). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1282 gene using the probe oligonucleotide and one of the PCR primers. RNA 
10 for construction of the cDNA libraries was isolated from human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full -length DNA sequence for 
PRO 1282 (designated herein as DNA45495- 1550 [Figure 3 1 . SEQ ID NO:5 1 ] ; and the derived protein sequence 
for PR01282. 

The entire coding sequence of PRO 1282 is shown in Figure 3 1 (SEQ ID NO:5I). Clone DNA45495- 
15 1550 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
120-122, and an apparent stop codon at nucleotide positions 2139-2141 (SEQ ID NO:51). The predicted 
polypeptide precursor is 673 amino acids long. The signal peptide is at about amino acids 1-23; the 
transmembrane domain is at about amino acids 579-599; an EGF-like domain cysteine pattern signature starts 
at about amino acid 430; and leucine zipper patterns start at about amino acids 197 and 269 of SEQ ID NO:52, 
20 see Figure 32. Clone DNA45495-I550 has been deposited with the ATCC and is assigned ATCC deposit no. 
203156. The full-length PR01282 protein shown in Figure 32 has an estimated molecular weight of about 
71,655 daltons and a pi of about 7.8. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 32 (SEQ ID NO:52), revealed sequence identity 
25 between the PR01282 amino acid sequence and the following Dayhoff sequences (data from database 
incorporated by reference): AB007876J, RNPLGPVJ, MUSLRRPJ, ALS_PAPPA. AC004142J, 
ALSHUMAN, AB014462J , DMTARTANJ , HSCHON03J and S46224. 

EXAMPLE 17: Isolation of c DNA clones Encoding Human PRQ35S 
30 Using the method described in Example 1 above, a single EST sequence was identified in the Incyte 

database, designated herein as INC31 15949. Based on the INC31 15949 EST sequence, oligonucleotides were 

synthesized to identify by PCR a cDNA library that contained the sequence of interest and for use as probes to 

isolate a clone of the full-length coding sequence for PR0358. 

A pair of PCR primers (forward and reverse) were synthesized: 
35 forward PCR primer 5 '-TCCCACC AGGTATC AT A A ACTG AA-3 ' (SEQ ID NO:58) 

reverse PCR primer 5'-TTATAGACAATCTGTTCTCATCAGAGA-3' (SEQ ID N0:59) 
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A probe was also synthesized: 

5'-AAAAAGCATACTTGGAATGGCCCAAGGATAGGTGTAAATG-3' (SEQ ID NO:60) 

In order to screen several libraries for a source of a ful.-length clone. DNA from the libraries was 
screened by PGR amplification with the PGR primer pair identified above. A positive .ibrarv was then used to 
tsolate Cones ercoding the PR0358 gene using the pro be oligonuc.eotide and one of the PCR primers RNA 
5 for consuuetion of the cDNA Hbraries was isolated from human bone marrow (UB256). The cDNA libraries 
used to isolated the cDNA clones were constructed by standard methods using commercial* avaiiab.e reagents 
such as toe from Invitrogen, San Diego. CA. The cDNA was primed with oligo dT containing a Not! site 
linked with blunt to Sal, hennWd adaptors, cleaved wttn Notl. si*ed appropriately by ge! electrophoresis' 
- dcl ™^a<fcf""dori^ 

10 of pRK5D that does not conuin the Sfi, site; see. Holmes e, a,.. Science 253:1278-1280 (.991)) in the unique 

Xhol and Notl sites. 

DNA sequencing of the clones isolated as described above gave the fuhMength DNA sequence for 
PK0358 (Figure 33. SEQ ,D NO:56) and the derived protein sequence for PR0358 (Figures 34. SEQ ,D 

15 The entire nucleotide sequence of the clone identified (DNA47361-1154) is shown in Figure 33 (SEQ 

ID NO:56). Clone DNA47361-U54 contains a single open reading frame with an apparent National 
mmauon site (ATG star, signal) at nucleotide positions underhned in Figure 33. Tne predicted po.ypeptide 
precursor is 81 1 amino acids long, including a putative signa, sequence (amino acids 1 to 19). an „,racel,ular 
domatn (amino acids 20 to 575. including leucine rich repeats in the reg.on from pos.tion 55 to position 575) 

20 a putative transmembrane domain (amino acids 576 to 595). Clone DNA4736 M249 has been deposited with 
ATCC and is assigned ATCC deposit no. 209431. 

EXAM£LEi8: I solalion of c f)NA clone. Fn^n,, M „„. n prmn[f , 

25 in E t 7T" SeqUenCC ^ re,aUVe IO ^ EST S ~ ^ * *«« 

» Sample 1 above. This consensus sequence is designated herein as DNA37164. Based on the DNA3 71 64 

census sequence, o.igonucleotides were seized: 1) to identify by PCR a cDNA library that contained 

^sequence of tnteres, and 2) for use as probes to isolate a .one of the futMengtb coding sequence for 

PCR primers (forward and reverse) were synthesized 

30 ta^ffiWs: 5'GTTCTCAATGAGCTACCCGTCCCC3' (SEQ ID NO:63) and 
XEVlUe^CR^^-CGCGATG^G^^^^^. (seq ^ ^ 

DNA473r UOnaIly ' 0Ug0nUCle ° ,ide hybridl " ,i0n Pr0be ™ — ' *» "* — 

DNA47394 sequence which had the following nucleotide sequence: 

hvbridizarinn p m>v- 

P w.th the PCR prtmer pan- .dentified above. A positive library was then used to 
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isolate clones encoding the PRO1310 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal liver tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PRO 13 10 and the derived protein sequence for PRO 13 10. 

The entire coding sequence of PRO1310 is shown in Figures 35A-B (SEQ ID NO:61). Clone 
DNA47394-1572 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 326-328, and an apparent stop codon at nucleotide positions 2594-2596 (SEQ ID NO:61). The 
predicted polypeptide precursor is 765 amino acids long. The signal peptide is at about amino acids 1-25 of SEQ 
ID NO:62. Clone DNA47394-1572 has been deposited with ATCC and is assigned ATCC deposit no. 203109. 
The full-length PR013 10 protein shown in Figure 36 has an estimated molecular weight of about 85,898 daltons 
and a pi of about 6.87. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 36 (SEQ ID NO;62), revealed sequence identity 
between the PRO 13 10 amino acid sequence and the following Dayhoff sequences: AF0I7639_1, P W36817, 
JC5256, CBPH_HUMAN, MMU23184 1, CBPN_HUMAN, HSU83411J. CEF01D4J7, RNU62897J and 
P_W11851. 

EXAMPLE 19: Isolation of cDNA Clones Encoding Human PRQ698 

A yeast screening assay was employed to identify cDN A clones that encoded potential secreted proteins . 
Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 
designated as DNA39906) is shown in Figure 39 (SEQ ID NO:68). Based on the DNA39906 sequence shown 
in Figure 39, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the 
sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0698. 
in order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 
amplification, as per Ausubel et ah, Cuirent Proto cols in Molecular Biology, with the PCR primer pair. A 
positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 
one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 '-AGCTGTGGTC ATGGTGGTGTGGTG-3 ' (SEQ ID NO:69) 
reverse PCR primer 5'-CTACCTTGGCCATAGGTGATCCGC-3' (SEQ ID NO:70) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA39906 
sequence which had the following nucleotide sequence 
hybridization probe 

5 -C ATCAGC AA ACCGTCTGTGGTTC AGCTCAACTGG AGAGGGTT-3 ' (SEQ ID NO:71) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0698 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human bone marrow tissue (LIB255). The cDNA 
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libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NoU 
site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; sec, Holmes et ah. Science . 253:1278-1280 
5 (1991)) in the unique Xhol and NotI sites. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 14- 16 and ending at the stop codon found at nucleotide positions 
1544-1546 (Figure 37, SEQ ID NO:66). The predicted polypeptide precursor is 510 amino acids long, has a 
calculated molecular weight of approximately 57,280 daltons and an estimated pi of approximately 5.61. 
10 Analysis of the full-length PR0698 sequence shown in Figure 38 (SEQ ID NO:67) evidences the presence of 
the following: a signal peptide from about amino acid I to about amino acid 20, potential N-glycosylation sites 
from about amino acid 72 to about amino acid 75, from about amino acid 136 to about amino acid 139, from 
about amino acid 1 93 to about amino acid 196, from about amino acid 253 to about amino acid 256, from about 
amino acid 352 to about amino acid 355 and from about amino acid 41 1 to about amino acid 414 an amino acid 
15 block having homology to legume lectin beta-chain proteins from about amino acid 20 to about amino acid 39 
and an arnino acid block having homology to the HBGF/FGF family of proteins from about amino acid 338 to 
about amino acid 365. Clone DNA48320-1433 has been deposited with ATCC on May 27, 1998 and is assigned 
ATCC deposit no. 209904. 

Analysis of the amino acid sequence of the full-length PR0698 polypeptide suggests that it possesses 
20 significant sequence similarity to the olfactomedin protein, thereby indicating that PR0698 may be a novel 
olfactomedin homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0698 amino acid sequence and the following Dayhoff sequences, 
OLFMRANCA, 173637, AB006686S3J, RNU78105J, RNU72487J, P_R98225, CELC48E7j*' 
CEFI1C3J3, XLU85970_1 and S42257. 

25 

EXAMPLE 20: Isolation of c DNA Clones Fncoding Human PRQ712 

A yeast screening assay was employed to identify cDNA clones that encoded potential secreted proteins. 
Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 
designated as DNA42580) is shown in Figure 45 (SEQ ID NO:77). The DNA42580 sequence was then 

30 compared to a variety of known EST sequences to identify homologies. The EST databases employed included 
public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 
Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
(Altthul et al., Methods in Enzyme^ 266:46CM80 (1996)) as a comparison to a 6 frame translation of the EST 
sequence. Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 

35 encode known proteins were clustered and assembled into consensus DNA sequences with the program -phrap" 
(Phil Green, University of Washington, Seattle, Washington). 
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Using the above analysis, a consensus DNA sequence was assembled relative to other EST sequences 
using phrap. This consensus sequence is herein designated consent) 1 . Proprietary Genentech EST sequences 
were employed in the consensus assembly and they are herein designated DNA20239 (Figure 42; SEQ ID 
NO:74), DNA38050 (Figure 43; SEQ ID NO:75) and DNA40683 (Figure 44; SEQ ID NO:76). 

Based on the consenOl sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA 
5 library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PR0732. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are 
often designed to give a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 
bp in length. In some cases, additional oligonucleotides are synthesized when the consensus sequence is greater 
than about I-1.5kbp. In order to screen several libraries for a full-length clone, DNA from the libraries was 
10 screened by PCR amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR 
primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe 
oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer S'-ATGTTTGTnTnnA Larnrrrrg y (SE q m NO; 78) 
15 forward PCR primer 5'-GTCAACATCCTCCTCTGC-3' (SEQ ID NO:79) 

reverse PQR primer 5 -AATCC ATTGTGC ACTGC AGCTCT AGG-3 ' (SEQ ID NO:80) 
reverse PCR primer 5 '-GAGCATGCCACC ACTGGACTG AC-3 ' (SEQ ID NO:8l) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA44143 
sequence which had the following nucleotide sequence 
20 hybridization probe 

S'-GCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACTAGATTGATCTATGCAC-S* (SEQ ID 

NO:82) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 
screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to folate clones encoding the PR0732 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB26). The 
cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego. CA. The cDNA was primed with oligo dT 
containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
30 by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of P RK5D that does not contain the Sfil site; see, Holmes et al., Science . 
253: 1278-1280 (1991)) in the unique Xhol and NotI sites. 

A full length clone was identified that contained a single open reading frame with an apparent 
translations initiation site at nucleotide poshions 88-90 and ending at the stop codon found at nucleotide positions 
35 1447-1449 (Figure 40, SEQ ID NO:72). The predicted poJypcptide precursor is 453 amino acids long, has a 
calculated molecular weight of approximately 50.419 daitons and an estimated P I of approximately 5.78. 
Analysis of the full-length PR0732 sequence shown in Figure 41 (SEQ ID NO:73) evidences the presence of 
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the following: a signal peptide from about amino acid 1 to about amino acid 28, transmembrane domains from 
about amino acid 37 to about amino acid 57, from about amino acid 93 to about amino acid 109, from about 
amino acid 126 to about amino acid 148, from aboui amino acid 151 to about amino acid 172, from about amino 
acid 197 to about amino acid 215, from about amino acid 231 to about amino acid 245, from about amino acid 
260 to about amino acid 279, from about amino acid 3 15 to about amino acid 333, from about amino acid 384 
to about amino acid 403 and from about amino acid 422 to about amino acid 447, potential N-glycosylation sites 
from about ammo acid 33 to about amino acid 36, from about amino acid 34 to about amino acid 37, from about 
amino acid 179 to about amino acid 183, from about amino acid 298 to about amino acid 301 , from about amino 
acid 337 to about amino acid 340 and from about amino acid 406 to about amino acid 409, an amino acid block 
having homology to the MIP family of proteins from about amino acid 119 to about amino acid 149 and an 
amino acid block having homology to DNA/RNA non-specific endonuclease proteins from about amino acid 279 
to about amino acid 286. Clone DNA48334-1435 has been deposited with ATCC on June 2, 1998 and is 
assigned ATCC deposit no. 209924. 

Analysis of the amino acid sequence of the full-length PR0732 polypeptide suggests that it possesses 
significant sequence similarity to the Diff33 protein, thereby indicating that PR0732 may be a novel DifD3 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0732 amino acid sequence and the following Dayhoff sequences. 
HS179M20 2, MUSTETUJ. CERIUM J, RATDRPJ, S51256, E69226, AE000869J, JC4120, 
CYB_ PARTE and P R50619. 

EXAMPLE 21 : Isolation of cDNA clones Encoding Human PRO I I2p 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein consen0352. The consen0352 sequence was 
then extended using repeated cycles of BLAST and phrap to extend the consensus sequence as far as possible 
using the sources of EST sequences discussed above. The extended consensus sequence is designated herein as 
DNA34365. Based on the DNA34365 consensus sequence, oligonucleotides were synthesized: 1) to identify 
by PCR i cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the 
full-length coding sequence for PROl 120. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primers: 5'-GAAGCCGGCTGTCTGAATC-3 ' (SEQ ID NO:85), 

5 -GGCCAGCTATCTCCGC AG-3 ' (SEQ ID NO:86), 5 AAGGGCCTGC AAG AGAAG-3 ' (SEQ ID NO:87), 

5 ' -CACTGGGACAACTGTGGG-3 ' (SEQ ID NO:88), 

5 ' -CAG AGGCAACGTGGAGAG-3 ' (SEQ ID NO:89), and 

5 - AAGT ATTGTC AT AC AGTGTTC- 3 1 (SEQ ID NO:90); 

reverse PCR primers : 5 -TAGTACTTGGGCACG AGGTTGG AG-3 ' (SEQ ID NO:91), and 5'- 
TCATACCAACTGCTGGTCATTGGC-3' (SEQ ID NO:92). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA34365 
consensus sequence which had the following nucleotide sequence: 
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hybridization probe : 

5 ' -CTC AAGCTGCTGG ACACGG AGCGGCCGGTGAATCGGTTTC ACTTG-3 ' (SEQ ID NO:93). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to isolate clones encoding the PRO 1 120 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PRO1120 (designated herein as DNA4 8606- 1479 [Figures 46A-B, SEQ ID NO:83]; and the derived protein 
sequence for PROl 120. 

The entire coding sequence of PRO1120 is shown in Figures 46A-B (SEQ ID NO:83). Clone 
DNA48606-1479 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 608-610 and an apparent stop codon at nucleotide positions 3209-3211. The predicted polypeptide 
precursor is 867 amino acids long. The full-length PROl 120 protein shown in Figure 47 has an estimated 
molecular weight of about 100,156 Daltons and a pi of about 9.44. Additional features of the PROl 120 
polypeptide include a signal peptide at about amino acids 1-17; a sulfatase signature at about amino acids 86-98; 
regions of homology to sulfatases at about amino acids 87-106, 133-146, 216-229, 291-320, and 365-375; and 
potential N-glycosylation sites at about amino acids 65-68, 112-115, 132-135, 149-152, 171-174, 198-201,241- 
245, 561-564, 608-611, 717-720, 754-757, and 764-767. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 47 (SEQ ID NO:84), revealed significant 
homology between the PROl 120 amino acid sequence and the following Dayhoff sequences: CELK09C4_1, 
Gl^S_HUMAN,G65169,NCU89492_l,BCU44852_l,E64903 t P_R5I355,STS_HUMAN l GA6^_HUMAN^ 
and IDSMOUSE. Clone DNA48606-1479 was deposited with the ATCC on July 1, 1998, and is assigned 
ATCC deposit no. 203040. 

EXAMPLE 22 : Isolation of cDNA clones Encoding Human PRQ537 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated as Incyte EST cluster no. 29605. This EST cluster 
sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 
databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, 
CA) to identify existing homologies. The homology search was performed using the computer program BLAST 
or BLAST2 (Altshul et al. t Methods in E nzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA48350. 

In light of an observed sequence homology between the DNA48350 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R63443, the Merck EST clone R63443 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this ins en encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 48 and is herein designated as DNA4914 1-1431. 

Clone DNA4914 1-1431 contains a single open reading frame with an apparent translational initiation 
siie at nucleotide positions 97-99 and ending at the stop codon at nucleotide positions 442^44 (Figure 48). The 
predicted polypeptide precursor is 115 amino acids long (Figure 49). The full-length PR0537 protein shown 
in Figure 49 has an estimated molecular weight of about 13,183 daltons and a pi of about 12. 13. Analysis of 
5 the full-length PR0537 sequence shown in Figure 49 (SEQ ID NO:95) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 31, a potential N-glycosylation site from about 
amino acid 44 to about amino acid 47, potential N-myristolation sites from about amino acid 3 to about amino 
acid 8 and from about amino acid 16 to about amino acid 21 and an amino acid block having homology to 
multicopper oxidase proteins from about amino acid 97 to about amino acid 105. Clone DNA4914M431 has 

10 been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203003. 

An analysis of the Dayhoff database (version 35,45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 49 (SEQ ID NO:95), evidenced homology 
between the PR0537 amino acid sequence and the following Dayhoff sequences: A54523, CELF22H10_2, 
FKH4_MOUSE,OTXlJIUMAN,URBI_UST^ 

15 and HRP3_PLAFS. 



EXAMPLE 23 : Isolation of cDNA clones Encoding Human PRCmfi 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as ss.clu2437.init. This EST cluster sequence was 
20 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et ah, Methods in EnTymnlnpY QfiA A^an (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
25 assembled mto a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA48351. 

In light of an observed sequence homology between the DNA48351 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. Hi 1 129, the Merck EST clone HI 1129 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
) The sequence of this cDNA insert is shown in Figure 50 and is herein designated as DNA49142-1430. 

Clone DNA49142-1430 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 48-50 and ending at the stop codon at nucleotide positions 987-989 (Figure 50). The 
predicted polypeptide precursor is 313 amino acids long (Figure 51). The full-length PR0536 protein shown 
in Figure 51 has an estimated molecular weight of about 34,189 daltons and a pi of about 4.8. Analysis of the 
rull-Iength PR0536 sequence shown in Figure 51 (SEQ ID NO:97) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 25, a potential N-glycosylation site from about amino 
acid 45 to about amino acid 48 and an ammo acid sequence block having homology to sulfatase proteins from 
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aboui amino acid 16 to about amino acid 26. Clone DNA49 142- 1430 has been deposited with ATCC on June 
23, 1998 and is assigned ATCC deposit no. 203002. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 51 (SEQ ID NO:97), evidenced homology 
between the PR0536 amino acid sequence and the following Dayhoff sequences: APU46857_1, PK2 DICDI, 
5 H64743, F5I14J8, CEAM_ECOLI, GEN14267, H64965, TCU39815J , PSBJJDDOSI and P_R06980. 

EXAMPLE 24: Isolation of cDNA clones En coding Human PROS1S 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as ss.clu 12694. init. This EST cluster sequence was 

10 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzymolopv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

15 assembled into a consensus DNA sequence with the program -phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA48352. Two 
propietary Genentech EST sequences were employed in the assembly are are herein shown in Figures 54 and 
55. 

m light of an observed sequence homology between the DNA48352 consensus sequence and an EST 
20 sequence encompassed within the Merck EST clone no. H86994, the Merck EST clone H86994 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 52 and is herein designated as DNA49 143- 1429. 

Clone DNA49143-1429 contains a single open reading frame with an apparent translaiional initiation 
site at nucleotide positions 78-80 and ending at the stop codon at nucleotide positions 681-683 (Figure 52). The 
25 predicted polypeptide precursor is 201 amino acids long (Figure 53). The full-length PR0535 protein shown 
in Figure 53 has an estimated molecular weight of about 22, 1 80 daitons and a pi of about 9.68. Analysis of the 
full-length PR0535 sequence shown in Figure 53 (SEQ ID NO:99) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 25, a transmembrane domain from about amino acid 
155 to about amino acid 174, a potential N-glycosylation site from about amino acid 196 to about amino acid 
30 199 and FKBP-type peptidyl -prolyl cis-trans isomer signature sequences from about amino acid 62 to about 
amino acid 77, from about amino acid 87 to about amino acid 123 and from about amino acid 128 to about amino 
acid 141. Clone DNA49143-1429 has been deposited with ATCC on June 23, 1998 and is assigned ATCC 
deposit no. 203013. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST- sequence 
alignment analysis of the full-length sequence shown in Figure 53 (SEQ ID NO:99), evidenced homology 
between the PR0535 amino acid sequence and the following Dayhoff sequences: S71237, PR93551 , P R28980, 
S71238, FKB2_HUMAN, CELC05C8_1, S55383, S72485, CELC50F2_6 and S75I44. 
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EXAMPLE 25 : Isolation of cDNA clones Encoding Human PR07I8 

A cDNA sequence isolated in the amylase screen described in Example 2 (human fetal lung library) 
above is herein designated DNA43512 (see Figure 62; SEQ ID NO: 108). The DNA43512 sequence was then 
compared to a variety of expressed sequence lag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to 
5 identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et ah, Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA45625. Proprietary 
10 Genentech EST sequences were employed in the assembly and are herein shown in Figures 58-61 . 

Based on the DNA45625 sequence, oligonucleotide probes were generated and used to screen a human 
fetal lung library (LIB25) prepared as described in paragraph 1 of Example 2 above. The cloning vector was 
pRKSB (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et ah, Science . 
±51:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. 
15 PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5'-GGGTGGATGGTACTGCTGCATCC-3' (SEQ ID NO: 109) 
reverse PCR primer 5 -TGTTGTGCTGTGGG AAATC AG ATGTG-3 ' (SEQ ID NO: 110) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45625 sequence 
which had the following nucleotide sequence: 
20 hybridization probe 

5 -GTGTCTGGAGGCTGTGGCCGTTTTGTTITCTTGGGCTAAA ATCGGG-3 ' (SEQ ID NO:lII) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0718 gene using the probe oligonucleotide and one of the PCR primers. 

25 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 36-38 and ending at the stop codon found at nucleotide positions 
607-609 (Figure 56; SEQ ID NO: 102). The predicted polypeptide precursor is 157 amino acids long, has a 
calculated molecular weight of approximately 17,400 daltons and an estimated pi of approximately 5.78. 
Analysis of the full-length PR0718 sequence shown in Figure 57 (SEQ ID NO: 103) evidences the presence of 

30 the following: a type II transmembrane domain from about amino acid 21 to about amino acid 40, and other 
transmembrane domains at about amino acid 58 to about amino acid 78, about amino acid 95 to about amino acid 
114, and about ammo acid 127 to about amino acid 147; a cell attachment sequence from about amino acid 79 
to about amino acid 81; and a potential N-glycosylation site from about amino acid 53 to about amino acid 56. 
Clone DNA49647-I398 has been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 
35 209919. 

Analysis of the amino acid sequence of the full-length PR071 8 polypeptide suggests that it possesses 
no significant sequence similarity to any known protein. However, an analysis of the Dayhoff database (version 
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35.45 SwissProt 35) evidenced some degree of homology between the PR0718 amino acid sequence and the 
following Dayhoff sequences: AF045606J, AF039906J, SPBC8D2_2, S63441. F64728, COX1TRYBB, 
F64375, E64173, RPYGJT_3, MTCY261_23. 

EXAMPLE 26 : isolation of cDNA clones Encoding Human PRQ872 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence designated herein as clul 20709. init. The clul 20709. init sequence was then compared a 
proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al.. Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA48254. 

In light of an observed sequence homology between the DNA48254 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3438068, the Incyte EST clone 3438068 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 63 and is the full-length DNA sequence for PR0872. 
Clone DNA49819-1439 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 
209931. 

T> e entire nucleotide sequence of DNA49819-1439 is shown in Figure 63 (SEQ ID NO: 112). Clone 
DNA49819-1439 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 14-16 and ending at the stop codon at nucleotide positions 1844-1846 (Figure 63). The predicted 
polypeptide precursor is 610 amino acids long (Figure 64). The full-length PR0872 protein shown in Figure 
64 has an estimated molecular weight of about 66,820 daltons and a pi of about 8.65. Analysis of the full-length 
PR0872 sequence shown in Figure 64 (SEQ ID NO: 1 13) evidences the presence of the following features: a 
signai peptide at amino acid 1 to about 18, putative transmembrane domains at about amino acids 70-87, 200-222 
and 568-588; sequence identity with bacterial -type phytoene dehydrogenase protein at about amino acids 71-105; 
sequence identity with a regulator of chromosome condensation (RCC1) signature 2 at about amino acids 201- 
211; leucine zipper patterns at about amino acids 214-235, 221-242, 228-249 and 364-385; a potential N- 
glycosylation site at about amino acids 271-274; and a glycosaminoglycan attachment site at about amino acids 
75-78. Analysis of the amino acid sequence of the full-length PR0872 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PR0872 amino acid sequence and the following 
Dayhoff sequences: PRCRTIJ, S75951, S74689, CELF37C4J, CRTIRHOCA, S76617, YNI2_METTL, 
MTV014J4, AOFB HUMAN, and MMU70429J. 

EXAMPLE 27: Isolation of cDNA clones Encoding Human PRO1063 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as ss.cluI19743.iniL The Incyte EST cluster sequence 
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ss. dull 9743. init sequence was then compared to a variety of expressed sequence tag (EST) databases which 
included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incytc 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzymology 266:460-480 ( 1 996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green. 
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA48288. 

In light of an observed sequence homology between the DNA48288 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2783726, the Incyte EST clone 2783726 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 65 and is herein designated DNA49820-1427. 

The full length clone shown in Figure 65 contained a single open reading frame with an apparent 
transitional initiation site at nucleotide positions 90-92 and ending at the stop codon found at nucleotide positions 
99J-99D (Figure 65; SEQ ID NO:l 14). The predicted polypeptide precursor is 301 amino acids long, has a 
calculated molecular weight of approximately 33,530 daltons and an estimated pi of approximately 4.80. 
Analysis of the full-length PRO1063 sequence shown in Figure 66 (SEQ ID NO: 1 15) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 21 , potential N-gtycosylation sites 
from about amino acid 195 to about amino acid 198, from about amino acid 217 to about amino acid 220 and 
from about amino acid 272 to about amino acid 275, a glycosaminoglycan attachment site from about amino acid 
267 to about amino acid 270, a microbodies C-terminal targeting signal site from about amino acid 299 to about 
amino acid 301, a type II fibronectin collagen-binding domain homology sequence from about amino acid 127 
to about amino acid 168 and a fructose-bisphosphate aldolase class II protein homology sequence from about 
ammo acid 101 to about amino acid 1 18. Clone DNA49820-1427 has been deposited with the ATCC on June 
2, 1998 and is assigned ATCC deposit no. 209932. 

Analysis of the amino acid sequence of the full-length PRO1063 polypeptide suggests that it possesses 
sequence similarity to the human type IV coliagenase protein. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO1063 amino acid 
sequence and the following Dayhoff sequences, S68303, CFU68533J, P_P91139, RNU65656J, 
PA2R_RABIT, MMU56734J, FINCXENLA, A48925, P_R92778 and FA12HUMAN. 

EXAMPLE 28: Isolation of cDNA clones Encoding Human PRQ61Q 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as 88434. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al.. Methods in Enzymolopy , 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
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of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle. 
Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1656694, the Incyte EST clone 1656694 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 67 and is herein designated as DNA4982 1-1562. 

The full length clone shown in Figure 67 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 81-83 and ending at the stop codon found at nucleotide positions 
450-452 (Figure 67; SEQ ID NO: 116). The predicted polypeptide precursor (Figure 68, SEQ ID NO: 117) is 
123 amino acids long including a predicted signal peptide at about amino acids 1-20. PR0619 has a calculated 
molecular weight of approximately 13,710 daltons and an estimated pi of approximately 5.19. Clone 
DNA4982M562 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit no. 209981. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
aligLmenL analysis of the full-length sequence shown in Figure 68 (SEQ ID NO: 117), revealed significant 
homology between the PR06I9 amino acid sequence and the following Dayhoff sequences: S35302, D87009 1 . 
HSU93494 1, HUMIGLAM5J, D86999_2, HUMIGLYMl l, HUMIGLYMKE I , A29491J, A29498J, 
and VPR2_MOUSE. 

E XAMPLE 29 : Isolation of cDNA clones Encoding Human PRQ943 

A consensus DNA sequence encoding PR0943 was assembled relative to other EST sequences using 
phrap as described in Example 1 above. This consensus sequence was then extended using repeated cycles of 
BLAST and phrap to extend the consensus sequence as far as possible using the sources of EST sequences 
discussed above. The extended consensus sequence is herein designated DNA36360. Based on the DNA36360 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0943. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 , -CGAGATGACGCCGAGCCCCC-3 I (SEQ ID NO:120) 
reverse PCR primer 5 '-CGGTTCGACACGCGGCAGGTG-3 1 (SEQ ID NO:12I) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36360 
sequence which had the following nucleotide sequence 
hybridization probe 

5 '-TGCTGCTCCTGCTGCCGCCGCTGCTGCTGGGGGCCTTCCCGCCGG-3 ' (SEQ ID NO: 122) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0943 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0943 (designated herein as DNA52 192- 1369 [Figure 69, SEQ ID NO: 1 18)) and the derived protein sequence 
for PR0943. 

The entire nucleotide sequence of DNA52192-1369 is shown in Figure 69 (SEQ ID NO:l 18). Clone 
DNA52 192- 1369 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
5 positions 150-152 and ending at the stop codon at nucleotide positions 1662-1664 (Figure 69). The predicted 
polypeptide precursor is 504 amino acids long (Figure 70). The full-length PR0943 protein shown in Figure 
70 has an estimated molecular weight of about 54,537 dalions and a pi of about 10.04. Analysis of the full- 
length PR0943 sequence shown in Figure 70 (SEQ ID NO: 1 19) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 17, a transmembrane domain from about amino acid 376 

10 to about amino acid 396, tyrosine kinase phosphorylation sites from about amino acid 212 to about amino acid 
219 and from about amino acid 329 to about amino acid 336, potential N-glycosylation sites from about amino 
acid 1 1 1 to about amino acid 1 14, from about amino acid 23 1 to about amino acid 234, from about amino acid 
255 to about amino acid 258 and from about amino acid 293 to about amino acid 296 and an immunoglobulin 
md MHC protein sequence homology block from about amino acid 219 to about amino acid 236. Clone 

15 DNA52I92-1369 has been deposited with ATCC on July 1, 1998 and is assigned ATCC deposit no. 203042. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 70 (SEQ ID NO: 119), evidenced significant 
homology between the PR0943 amino acid sequence and the following Dayhoff sequences: B49151, A39752, 
FGR1XENLA, S38579, RATHBFGFRBl, TVHU2F, FGR2MOUSE, CEK3_CHICK, P_R21080 and 

20 A2717IJ. 

EXAMPLE 30: Isolation of cDNA clones Encoding Human PRO MRS 

A consensus DNA sequence was assembled relative to other EST sequences using the program "phrap" 
as described in Example i above. This consensus sequence is designated herein as DNA45679. Based on the 
25 DNA45679 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that 
contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence 
for PRO 1188. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-CTGGTGCCTCAACAGGGAGCAG-3' (SEQ ID NO:125) 
reverse PCR primer 5'-CCATTGTGCAGGTCAGGTCACAG-3 ' (SEQ ID NO: 126) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45679 sequence which had the following nucleotide sequence: 
hybridization pmhe 

5 -CTGG AGC AAGTGCTC AGCTGCCTGTGGTC AG ACTGGGGTC-3 ' (SEQ ID NO: 127) 
35 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 188 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01188 (designated herein as DNA52598-1518 [Figure 71, SEQ ID NO:123J); and the derived protein 
sequence for PR01188. 

The entire coding sequence of PROl 1 88 is shown in Figure 71 (SEQ ID NO: 123). Clone DNA52598- 
5 1518 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
136-138 and an apparent stop codon at nucleotide positions 3688-3690. The predicted polypeptide precursor 
is 1184 amino acids long. The full-length PROl 188 protein shown in Figure 72 has an estimated molecular 
weight of about 132,582 Daltons and a pi of about 8.80. Additional features include: a signal peptide at about 
amino acids 1-31; an ATP/GTP binding site motif A (P-loop) at about amino acids 266-273; an aldehyde 
10 dehydrogenases cysteine active site at about amino acids 188-199; growth factor and cytokines receptors family 
signature 2 at about amino acids 153-159; and potential N-glycosylation sites at about amino acids 129-132, 132- 
135, 346-349, 42(W23, 550-553, 631-634, 1000-1003, and 1056-1059. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
aligummt uialysis of the mil-length sequence shown in Figure 72 (SEQ ID NO: 124), revealed significant 
homology between the PR01188 amino acid sequence and the following Dayhoff sequences: SSU83114J, 
S56015, CET21B6_4, CELT19D21, and TSP1MOUSE. 

Clone DNA52598-1518 has been deposited with ATCC and is assigned ATCC deposit no 203107. 
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EXAMPLE 31: Isolation of c DNA clones Fncoding Human PROl 133 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This sequence was extended using repeated cycles of phrap. The extended consensus 
sequence is designated herein DNA38102. Based on the DNA38102 consensus sequence, oligonucleotides were 
synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as 
probes to isolate a clone of the full-length coding sequence for PR01133. 
25 PCR primers (two forward and one reverse) were synthesized: 

fonvard PCfi primer 1 5'-TCGATTATGGACGAACATGGCAGC-3 ' (SEQ ID NO: 130); 
fo rward PCR primer2 5'-TTCTGAGATCCCTCATCCTC-3' (SEQ ID NO:131); and 
r everse primer 5 - AGGTTC AGGG ACAGCAAGTTTGGG-3 ' (SEQ ID NO:132). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
30 DNA38 102 sequence which had the following nucleotide sequence: 
hybridization prnfr g 

5"nTGCTGGACCTCGGCTACGGAATTGGCTTCCCTCTACGGACAGCTGGAT3' (SEQ ID NO: 133). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amp.if.ca.ion with a PCR primer pair identified above. A positive library was then used to 
.solate clones encoding the PRO 1 133 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01133 and the derived protein sequence for PRO 1133. 

The entire coding sequence of PROl 133 is shown m Figure 73 (SEQ ID NO: 128). Clone DNA53913- 
1490 contains a single open reading frame with an apparent translation^ initiation site at nucleotide positions 
266-268 and an apparent stop codon at nucleotide positions 1580-1582 of SEQ ID NO: 128. The predicted 
polypeptide precursor is 438 amino acids long. The signal peptide is at amino acids 1-18 of SEQ ID NO: 129. 
EGF-Jike domain cysteine pattern signatures start at 315 and 385 of SEQ ID NO: 129 as shown in Figure 74. 
Clone DNA53913-1490 has been deposited with ATCC and is assigned ATCC deposit no. 203162. The full- 
length PROl 133 protein shown in Figure 74 has an estimated molecular weight of about 49,260 daltons and a 
pi of about 6.15. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 74 (SEQ ID NO: 129), revealed some sequence 
identity between the PROl 133 amino acid sequence and the following Dayhoff sequences (data from the database 
incorporated herein): AF002717J, LMG INHUMAN, B54665, UNC6_CAEEL, LML1_CAEEL, 
~M \5_MOUSE. MMU88353J, LMA1_HUMAN, HSLN2C64J and AF005258 1. 

EXAMPLE 32: Isolation of c DNA clones Encoding Human PRQ7R4 

An initial DNA sequence (SEQ ID NO: 136), referred to herein as DNA4466 1 and shown in Figure 77, 
was identified using a yeast screen, in a human fetal lung cDNA library that preferentially represents the 5* ends 
of the primary cDNA clones. DNA44661 was then compared to ESTs from public databases (e.g., GenBank), 
and a proprietary EST database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA), using the computer 
program BLAST or BLAST2 [Altschu] ct ah, MejhpdoiEnzyjn^lp^ 266:460-480 < 1 996) J . TheESTswere 
then clustered and assembled into a consensus DNA sequence using the computer program "phrap- (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained is designated herein as 
"DNA45463". Based on the DNA45463 consensus sequence, oligonucleotides were synthesized for use as 
probes to isolate a clone of the full-length coding sequence for PR0784 from a human fetal lung cDNA library. 

The full length DNA53978-1443 clone shown in Figure 75 contained a single open reading frame with 
an apparent translauonal initiation site at nucleotide positions 37-39 and ending at the stop codon found at 
nucleotide positions 821-823 (Figure 75; SEQ ID NO:134). The predicted polypeptide precursor (Figure 76, 
SEQ ID NO: 135) is 228 amino acids long. PR0784 has a calculated molecular weight of approximately 25,735 
Daltons and an estimated pi of approximately 5.45. PR0784 has the following features: a signal peptide at about 
amino acid I to about 15; transmembrane domains at about amino acids 68 to about 87 and at about 183 to about 
204; potential N-myristoylation sites at about amino acids 15-20,51-56. 66-60, 163-168, and 206-21 1; and an 
RNP-i protein RNA-binding region at about amino acids 108 to about 117. 

Clone DNA53978-1443 was deposited with ATCC on June 16, 1998, and is assigned ATCC deposit 
no. 209983. 

Based on a BLAST and FastA sequence alignment analysis (using the ALIGN computer program) of 
the full-length sequence, PR0784 shows amino acid sequence identity to the following proteins: RNU42209 1, 
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253:1278-1280 (1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0783 [herein designated as DNA53 996- 1442] (SEQ ID NO: 137) and the derived protein sequence for 
PR0783. 

The entire nucleotide sequence of DNA53996-I442 is shown in Figure 78 (SEQ ID NO: 137). Clone 
5 DNA53996- 1442 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 310-312 and ending at the stop codon at nucleotide positions 1777-1779 (Figure 78). The predicted 
polypeptide precursor is 489 amino acids long (Figure 79). The full-length PR0783 protein shown in Figure 
79 has an estimated molecular weight of about 55,219 daltons and a pi of about 8.47. Analysis of the full-length 
PR0783 sequence shown in Figure 79 (SEQ ID NO: 138) evidences the presence of the following features: 
10 transmembrane domains located at about amino acids 23-42, 67-89, 111-135, 154-176, 194-218, 296-319, 
348-370, 387-410 and 427-452; leucine zipper patterns located at about amino acids 263-283 and 39^420; a 
potential tyrosine kinase phosphorylation site at about amino acids 180-187; potential N-glycosylation sites at 
about amino acids 105 -108 and 121-124; potential cAMP- and a cGMP-dependem protein kinase 
phosphorylation site at about amino acids 288-291; and a region having sequence identity with bacterial 
15 rhodopsins retinal binding site protein at about amino acids 190-218. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35) shows some sequence identity 
between the PR0783 amino acid sequence and the following Dayhoff sequences: YNC2_CAEEL. D64048, 
ATAC002332JF4P9.3, NY2R_SHEEP, and VSH_MUMPA. 

Clone DNA53996-1442 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit 
20 no. 209921. 

EXAMPLE 34 : Isolation of cDNA Clones Encoding Human PRO820 

An expressed sequence tag (EST) DNA database (Merck AVash. U) was searched and an EST designated 
EST no. AA504080, Merck clone 825136, was identified (library 3 12, human B-cell tonsil). Homology searches 

25 revealed that this EST showed sequence identity with low affiniry immunoglobulin gamma Fc receptor II. DNA 
sequencing gave the full-length DNA sequence for PRO820 and the derived protein sequence for PRO820, 

The entire nucleotide sequence of DNA56041-1416 is shown in Figure 82 (SEQ ID NO: 145). Clone 
DNA56041-1416 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 115-117 and ending at the stop codon at nucleotide positions 487-489 (Figure 82). The predicted 

30 polypeptide precursor is 124 amino acids long (Figure 83). The full-length PRO820 protein shown in Figure 
83 has an estimated molecular weight of about 14,080 daltons and a pi of about 7.48. Clone DNA56041-1416 
has been deposited with ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO: 146, the putative signal peptide is at about amino 

35 acids 1-15 of SEQ ID NO: 146. Protein kinase C phosphorylation sites are at about amino acids 20-22 and 43-45 
of SEQ ID NO: 146. An N-myristoylation site is at about amino acids 89-94 of SEQ ID NO: 146. An 
immunoglobulin and major histocompatibility complex domain is at about amino acids 83-90 of SEQ ID NO: 1 46. 
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The corresponding nucleotides can be routinely determined given the sequences provided herein. 

EXAMPLE 35 : Isolation of cDNA Clones Encoding Human PRO1080 

A consensus DNA sequence was assembled relative to other EST sequences using phrap and was 
extended using repeated cycles of BLAST and phrap so as to extend the consensus sequence as far as possible 
5 using the sources of the EST sequences as described in Example 1 above. The consensus sequence is designated 
herein as DNA52640. An EST proprietary to Genentech was employed in the consensus assembly and is herein 
designated as DNA36527 (Figure 86; SEQ ID NO: 149). 

In light of an observed sequence homology between the DNA36527 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 526423, the Merck EST clone 526423 was purchased 
10 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 84 and is herein designated as DNA56047-1456. 

The entire nucleotide sequence of DNA56047-1456 is shown in Figure 84 (SEQ ID NO: 147). Clone 
DNA56047-1456 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 159-161 and ending at the stop codon at nucleotide positions 1233-1235 of SEQ ID NO: 147 (Figure 
15 84). The predicted polypeptide precursor is 358 amino acids long (Figure 85). The full-length PROI080 protein 
shown in Figure 85 has an estimated molecular weight of about 40,514 dalions and a pi of about 6.08. Clone 
DNA56047-1456 has been deposited with ATCC on June 9, 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

20 Also shown in Figure 85 are the approximate locations of the signal peptide, cell attachment site, Nt- 

DnaJ domain signature, region having sequence identity with Nt-DnaJ domain proteins, and N-glycosylation 
sites. The corresponding nucleic acids of these amino acid sequences and others provided herein can be 
routinely determined by the information provided herein. 

25 EXAMPLE 36 : Isolation of cDNA Clones Encoding Human PRO1079 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above, and is herein designated DNA52714. Based on information provided by the assembly, the 
clone for Merck EST no. H06898 was obtained and sequenced, thereby giving the nucleotide sequence 
designated herein as DNA56050-1455. The entire nucleotide sequence of DNA56050-1455 is shown in Figure 

30 87 (SEQ ID NO: 150). Clone DNA5605O-1455 contains a single open reading frame with an apparent 
translational initiation site at nucleotide positions 183-185 and ending at the stop codon at nucleotide positions 
861-863 (Figure 87). The predicted polypeptide precursor is 226 amino acids long (Figure 88). The full-length 
PRO 1079 protein shown in Figure 88 has an estimated molecular weight of about 24,61 1 Daltons and a pi of 
about 4.85. Analysis of the full-length PRO1079 sequence shown in Figure 88 (SEQ ID NO:3) evidences the 

35 presence of the following features: a signal peptide at about amino acid 1-29; potential N-myristoylation sites 
at about amino acids 10-15, and 51-56; homology to photosystem I psaG and psaK proteins at about amino acids 
2 to 20; and homology to prolyl endopeptidase family serine proteins ai about amino acids 150 to 163. 
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Analysis of the amino acid sequence of the full-length PRO 1079 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced some sequence identity between the PRO1079 amino acid sequence and 
the following Dayhoff sequences: CEK10C3_4, MMU50734 1, D69503, AF051149_1, and VSMPCVMS. 

Clone UNQ536 (DNA56O50-1455) was deposited with the ATCC on June 22, 1998. and is assigned 
ATCC deposit no. 203011. 

5 

EXAMPLE 37 : Isolation of cDNA clones Encoding Human PRQ793 

A cDNA clone (DNA561 10-1437) encoding a native human PR0793 polypeptide was identified by a 
yeast screen, in a human skin rumor cDNA library that preferentially represents the 5' ends of the primary 
cDNA clones. The yeast screen employed identified a single EST clone designated herein as DNA50177 (Figure 
10 91 ; SEQ ID NO: 154). The DNA50177 sequence was then compared to various EST databases including public 
EST databases (e.g. , GenBank), and a proprietary EST database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, 
CA) to identify homologous EST sequences. The comparison was performed using the computer program 
BLAST or BLAST2 [AJtschul et ah, Methods in Enzvmologv. 266:460-480 (1996)]. Those comparisons 
resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were 
15 clustered and assembled into a consensus DNA sequence with the program "phfap" (Phil Green, University of 
Washington, Seattle, Washington). This consensus sequence is herein designated DNA50972, 

In light of an observed sequence homology between the DNA50972 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. N33874, the Merck EST clone N33874 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
20 The sequence of this cDNA insert is shown in Figure 89 and is herein designated as DNA561 10-1437. 

The full-length DNA561 10-1437 clone shown in Figure 89 contains a single open reading frame with 
an apparent translation^ initiation site at nucleotide positions 77-79 and ending at the stop codon at nucleotide 
positions 49M93 (Figure 89). The predicted polypeptide precursor is 138 amino acids long (Figure 90). The 
full-length PR0793 protein shown in Figure 90 has an estimated molecular weight of about 15,426 daltons and 
25 a pi of about 10.67. Analysis of the full-length PR0793 sequence shown in Figure 90 (SEQ ID NO: 153) 
evidences the presence of the following: transmembrane domains from about amino acid 12 to about amino acid 
30, from about amino acid 33 to about amino acid 52, from about amino acid 69 to about amino acid 89 and 
from about amino acid 93 to about amino acid 109, potential N-myristolation sites from about amino acid 1 1 to 
about amino acid 16, from about amino acid 51 to about amino acid 56 and from about amino acid 1 16 to about 
30 amino acid 121 and an amino acid sequence block having homology to an aminoacyl -transfer RNA synthetase 
class-II protein from about amino acid 49 to about amino acid 59. Clone DNA561 10-1437 has been deposited 
with ATCC on August 11, 1998 and is assigned ATCC deposit no. 203113. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 90 (SEQ ID NO: 153), evidenced certain 
35 homology between the PR0793 amino acid sequence and the following Dayhoff sequences: S47453, 
AF015193J2, MTEHGNS9 2, E64030, H69784, D64995, CD53_MOUSE, GEN8006, AE001138_7 and 
COX2STRPU. 
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EXAMPLE 3g: Isolation of cD NA Clones Encoding Human PROW! 6 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. The consensus sequence obtained is herein designated DNA53502. 

In light of an observed sequence homology between the DNA53502 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 38680, the Merck EST clone 38680 was purchased and 
5 the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 92. 

The entire nucleotide sequence of DNA561 13-1378 is shown in Figure 92 (SEQ ID NO: 155). Clone 
DNA561 13-1378 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 168-170 and ending at the stop codon at nucleotide positions 1302-1304 (Figure 92). The predicted 
10 polypeptide precursor is 378 amino acids long (Figure 93). The full-length PRO1016 protein shown in Figure 
93 has an estimated molecular weight of about 44,021 daitons and a pi of about 9.07. Clone DNA561 13-1378 
has been deposited with the ATCC. Regarding the sequence, it is understood thai the deposited clone contains 
the couect sequence, and the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO1016 polypeptide suggests that portions of 
15 it possess sequence identity with acyltransferase, thereby indicating that* PRO 101 6 may be a novel 
acyltransferase. 

Still analyzing the amino acid sequence of SEQ ID NO: 156, the putative signal peptide is atabout amino 
acids 1-18 of SEQ ID NO: 156. The transmembrane domain(s) are at about amino acids 332-352 and 305-330 
of SEQ ID NO: 156. The fructose-bisphosphate aldolase class-II protein homology sequence is at about amino 
20 acids 73-90 of SEQ ID NO: 156. The extradiol ring-cleavage dioxygenase protein is at about amino acids 252- 
275 of SEQ ID NO:156. The corresponding nucleotides can be routinely determined given the sequences 
provided herein. 

The specific Dayhoff database designation names of sequences to which PRO1016 has sequence identity 
with include the following: S52645, P_R59712, PR99249, P_R59713, BNAGPATRFJ, CELT05H4J5 and 
25 CELZK40J. 



EXAMPLE 39 : Isolation of cDNA Encoding Human PRO 10 13 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. The consensus DNA sequence was then extended using repeated cycles of BLAST and 
30 phrap to extend the consensus sequence as far as possible using the sources of EST sequences. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 3107695, the Incyte EST clone 3107695 was purchased and the 
cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insen is shown in Figure 94 and is herein designated as DNA564 10-14 14. 

The entire nucleotide sequence of DNA56410-1414 is shown in Figure 94 (SEQ ID NO: 157). Clone 
DNA564I0-1414 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 17-19 and ending at the stop codon at nucleotide positions 1244-1246 (Figure 94). The predicted 
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polypeptide precursor is 409 amino acids long (Figure 95). The full-length PRO1013 protein shown in Figure 
95 has an estimated molecular weight of about 46,662 daJtons and a pi of about 7. 18. Clone DNA564 10-14 14 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO: 158, the putative signal peptide is at about amino 
5 acids 1-19 of SEQ ID NO: 158. N-glycosylation sites are at about amino acids 75-78 and 322-325 of SEQ ID 
NO: 158. An N-myristoylation site is at about amino acids 184-189 of SEQ ID NO: 158. A growth factor and 
cytokine receptor family domain is at about amino acids 134-149 of SEQ ID NO: 158. The corresponding 
nucleotides can be routinely determined given the sequences provided herein. 

Blast analysis showed some sequence identity with other proteins. Specifically, PRO 10 1 3 has some 
10 sequence identity with at least the Dayhoff sequences designated: D638771 ; MHU22019_1 , AE000730_10, and 
AF0I9079J. 

EXAMPLE 40 : Isolation of cDNA Clones Encoding Human PRQ937 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. That consensus sequence is herein designated DNA4965I. Based on the DNA49651 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0937. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5'-CTCCGTGGTAAACCCCACAGCCC-3* (SEQ ID NO: 161); and 
reverse PCR primer 5 '-TCACATCGATGGGATCCATG ACCG-3 ' (SEQ ID NO: 162). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA48651 sequence 
which had the following nucleotide sequence: 
hybridization probe 

25 5 T -GGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAG-3' (SEQ ID NO: 163). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0937 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR0937 [herein designated as DNA56436-1448] (SEQ ID NO: 159) and the derived protein sequence for 
PR0937. 

The entire nucleotide sequence of DNA56436-1448 is shown in Figure 96 (SEQ ID NO: 159). It 
contains a single open reading frame having an apparent translational initiation site at nucleotide positions 499- 
35 501 and ending at the stop codon found at nucleotide positions 2167-2169 (Figure 96, SEQ ID NO: 159). The 
predicted polypeptide precursor is 556 amino acids long, has a calculated molecular weight of approximately 
62,4 12 daltons and an estimated pi of approximately 6.62. Analysis of the full-length PR0937 sequence shown 
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in Figure 97 (SEQ ID NO: 160) evidences the presence of the following features: signal peptide ai about amino 
acids 1-22; ATP/GTP-binding site motif A (P-loop) at about amino acids 515-523; a potential N-glycosylation 
site at about amino acids 514-517; and sites of glypican homology at about amino acids 54-74, 106-156, 238- 
279, 309-345, 423-459, and 468-505. 

Clone DNA56436-1448 has been deposited with ATCC on May 27, 1998, and is assigned ATCC 
deposit no. 209902. 

Analysis of the amino acid sequence of the full-length PR0937 polypeptide suggests that it possesses 
significant sequence similarity to glypican proteins, thereby indicating that PR0937 may be a novel glypican 
protein. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0937 amino acid sequence and the following Dayhoff sequences: 
GPCK_MOUSE, GPC2_RAT, GPC5_HUMAN, GPC3_HUMAN. PR30168, CEC03H12_2, GEN13820, 
HS119E23_1, HDAC DROME, and AF017637_1. 

EXAMPLE 41 : Isolation of cDNA clones Encoding Human PRQ842 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as lncyte EST cluster sequence no. 69572. This EST cluster 
sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 
databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alio, 
CA) to identify existing homologies. The homology search was performed using the computer program BLAST 
or BLAST2 (Altshul et al., Methods m Enzvmoloev 266:460-4 SO (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA54230. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Merck EST clone no. AA477092, the Merck EST clone AA477092 was purchased and 
the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 98 and is herein designated as DNA56855-1447. 

The full length clone shown in Figure 98 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 153-155 and ending at the stop codon found at nucleotide 
positions 510-512 (Figure 98; SEQ ID NO: 164). The predicted polypeptide precursor (Figure 99, SEQ ID 
NO: 165) is 1 19 amino acids long. PR0842 has a calculated molecular weight of approximately 13,819 Daltons 
and an estimated pi of approximately 11.16. Other features of PR0842 include a signal peptide at about amino 
acids 1-22, a potential protein kinase C phosphorylation site at about amino acids 39-41 and two potential N- 
myristoylation sites at about amino acids 27-32 and about amino acids 46-51. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 98 (SEQ ID NO: 164), evidenced some homology 
berween the PR0842 amino acid sequence and the following Dayhoff sequences: CEZK131J1, PR80843, 
RAT5HT2X_1, S81882 1, A60912, MCU603I5_137MC137L, U93422J, p_P91996, U93462J, and 
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Clone DNA56855-1447 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no. 203004. 

EXAMPLE 42 isolation of cDNA clones Encodine Human PRQ839 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte L1FESEQ* database, designated Incyte EST Cluster No. 24479. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 
public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 ( Altshul et al. , Methods in Enzymology 266:46(M80 ( 1 996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green. 
Lniversity of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA55709. 

In light of an observed sequence homology between the DNA55709 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 754525, the Merck EST clone 754525 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 100 and is herein designated as DNA56859-1445. 

The full length clone shown in Figure 100 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 2-4 and ending at the stop codon found at nucleotide positions 
263-265 (Figure 100; SEQ ID NO: 166). The predicted polypeptide precursor (Figure 101 , SEQ ID NO: 167) 
is 87 amino acids long. PR0839 has a calculated molecular weight of approximately 9,719 DaJtons and an 
estimated pi of approximately 4.67. Other features of PR0839 include a signal peptide at about amino acids 1- 
23, potential protein kinase C phosphorylation sites at about amino acids 37-39 and about amino acids 85-87, 
a potential casein kinase II phosphorylation site at about amino acids 37-40, sequence identity with ribonucleotide 
reductase large subunit protein at about amino acids 50-60, and sequence identity with eukaryotic RNA-binding 
region RNP-1 proteins at about amino acids 70-79. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 101 (SEQ ID NO: 167), evidenced some 
homology between the PR0839 amino acid sequence and the following Dayhoff sequences: CD14MOUSE, 
XPR6_YARLI, HS7I4385J. S49783, BBI9_RABIT, GVPH-HALME, AB003135J, P_R85453, 
LUU27081_2, and TP2B MOUSE. 

Clone DNA56859-1445 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no.209019. 
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EXAMPLE 43 : Isolation of cDNA Clones Encoding Human pRQJ I ffl 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte EST cluster sequence no. 14732). The Incyte EST cluster sequence no. 
14732 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GcnBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo 
Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altsfaul et al . , Methods in Enzvmologv 266:460-480 (1996)) . Those comparisons resulting 
in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA5571 1. 

In light of an observed sequence homology between the DNA55711 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T60981, the Merck EST clone T60981 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 102 and is herein designated DNA56860-1510. 

The full length clone shown in Figure 102 contained a single open reading frame with an apparent 
translations initiation site at nucleotide positions 78-80 and ending at the stop codon found at nucleotide positions 
909-911 (Figure 102; SEQ ID NO:168). The predicted polypeptide precursor is 277 amino acids long, has a 
calculated molecular weight of approximately 31,416 daltons and an estimated pi of approximately 8.88. 
Analysis of the full-length PROl 180 sequence shown in Figure 103 (SEQ ID NO: 169) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a leucine zipper pattern 
sequence from about amino acid 10 to about amino acid 31 , and potential N-myristoiation sited from about amino 
acid 64 to about amino acid 69, from about amino acid 78 to about amino acid 83, from about amino acid 80 
to about amino acid 85, from about amino acid 91 to about amino acid 96 and from about amino acid 201 to 
about amino acid 206. Clone DNA56860-1510 has been deposited with the ATCC on June 9, 1998 and is 
assigned ATCC deposit no. 209952. 

Analysis of the amino acid sequence of the full-length PRO 1 1 80 polypeptide suggests that it possesses 
sequence similarity to the methyltransferase family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO1180 amino acid 
sequence and the following Dayhoff sequences, MTCI6514, D69267, YH09YEAST, BIOC_SERMA, 
ATAC00448415T1D16.16, SHGCPIRJ8, SPBC3B9_4, AB009504J4, P_W17977 and A69952. 

EXAMPLE 44: Isolation of cDNA clones Encoding Human PROl 134 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 75 1 1 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (Lifeseq* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (AJtshul 
et al., Methods in Enzymology 266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington. Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55725. Two proprietary Genentech EST 
sequences were employed in the assembly and are shown in Figure 106 (SEQ ID NO: 172) and Figure 107 (SEQ 
ID NO: 173). 

in light of an observed sequence homology between the DNA55725 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H94897, the Merck EST clone H94897 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 106 and is herein designated as DNA56865-1491. 

Clone DNA56865-149I contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 153-155 and ending at the stop codon at nucleotide positions 1266-1268 (Figure 104). 
The predicted polypeptide precursor is 371 amino acids long (Figure 105). The full-length PROI 134 protein 
shown in Figure 105 has an estimated molecular weight of about 4 1 ,935 daltons and a pi of about 9.58. Analysis 
of the rull-length PROI 134 sequence shown in Figure 105 (SEQ ID NO: 171) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 23 , potential N-glycosylarion sites from 
about amino acid 103 to about amino acid 106, from about amino acid 249 to about amino acid 252 and from 
about amino acid 257 to about amino acid 260, and an amino acid block having homology to tyrosinase CuA- 
binding region proteins from about amino acid 280 to about amino acid 306. Clone DNA56865-1491 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203022. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 105 (SEQ ID NO: 171), evidenced significant 
homology between the PR01134 amino acid sequence and the following Dayhoff sequences: F20P5_I8, 
AC002396J0, S47847, C64146, GSPA_BACSU, P_W10564, RFAI_ECOLI, Y258_HAEIN, RFAJSALTY 
and P_R32985. 

EXAMPLE 45 : Isolation of cDNA clones Encoding Human PRQ83Q 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the incytedatabase, designated 20251 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
el 31 ■ • Methods in Ergymology 266:460-480 ( 1 996)) . Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55733. 

In light of an observed sequence homology between the DNA55733 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H78534, the Merck EST clone H78534 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 108 and is herein designated as DNA5 6866- 1342. 

Clone DNA56866-1342 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 154-156 and ending at the stop codon at nucleotide positions 415-417 (Figure 108). 
The predicted polypeptide precursor is 87 amino acids long (Figure 109). The full-length PRO830 protein shown 
in Figure 109 has an estimated molecular weight of about 9,272 daltons and a pi of about 9. 19. Analysis of the 
full-length PRO830 sequence shown in Figure 109 (SEQ ID NO: 175) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 33, potential N-myristoylation sites from about 
amino acid 2 to about amino acid 7 and from about amino acid 8 to about amino acid 13 and a thioredoxin family 
of proteins homology block from about amino acid 23 to about amino acid 39. Clone UNQ470 (DNA56866- 
1342) has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no. 203023. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 109 (SEQ ID NO:175), evidenced significant 
homology between the PRO830 amino acid sequence and the following Dayhoff sequences: HSU88 154J, 
HSU88153_1. SAPKSGENEl. HPU31791_5, GGCNOT2J. CPU9142IJ, CHKESTPO09J, PQ0769, 
U9/553J/9 and B60095. 

EXAMPLE 46 : Isolation of cDNA clones Encoding Human PROl 1 15 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the LIFESEQ* database, designated Incyte EST cluster sequence no. 165008. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 
public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 (Altshul et al. , Methods in Enzvmoloev 266:460-480 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA55726. 

In light of an observed sequence homology between the DNA55726 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R75784. the Merck EST clone R75784 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 1 1 1 and is herein designated as DNA56868-1478. 

The full length clone shown in Figure 1 10 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 189-191 and ending at the stop codon found at nucleotide 
positions 1524-1526 (Figure 110; SEQ ID NO: 176). The predicted polypeptide precursor (Figure 111, SEQ 
ID NO: 177) is 445 amino acids long. PROl 1 15 has a calculated molecular weight of approximately 50,533 
Daltons and an estimated pi of approximately 8.26. Additional features include a signal peptide at about amino 
acids 1-20; potential N-glycosylation sites at about amino acids 204-207, 295-298, and 313-316; and putative 
transmembrane domains at about amino acids 35-54, 75-97, 126-146. 185-204, 333-350, and 353-371. 
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An analysis of the Dayboff database (version 35.45 SwissProt 35). using a WU-BLAST-2 sequence 
aJignmem anaJysis of the full-length sequence shown in Figure 111 (SEQ ID NO: 177). evidenced some amino 
acid sequence identity between the PROU15 amino acid sequence and the following Daytaoff sequences: 
AF053947J79, S73698, CEC47A10_4, CCOMTNDS5GJ , HS4LMP2AC_1, LMP2_EBV, PA24MOUSE, 
HCU33331J7, P-W05508, and AF002273 1. 
5 Clone DN A56868-1478 was deposited with the ATCC on June 23 , 1 998 and is assigned ATCC deposit 

no. 203024.. 

EXAMPLE 47 : Isolation of cDNA clones Encoding Human PRO 1277 

A consensus DNA sequence was assembled relative to other ESTs using repeated cycles of BLAST and 
10 the program "phrap* as described in Example 1 above. One or more of the ESTs from the assembly was 
derived from diseased coronary artery tissue. The consensus sequence obtained is designated herein as 
"DNA49434". 

In light of an observed sequence homology between the DNA49434 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3042605, the Incyte EST clone 3042605 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert" encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 1 12 (SEQ ID NO: 178). 

Clone DNA56869-1545 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 1 88-190, and an apparent stop codon at nucleotide positions 2222-2224 (Figure 1 12). 
The predicted polypeptide precursor is 678 amino acids long (Figure 1 13). The full-length PRO 1277 protein 
20 shown in Figure 113 has an estimated molecular weight of about 73,930 daltons and a pi of about 9.48. 
Additional features include a signal peptide at about amino acids 1-26; a transmembrane domain at about amino 
acids 181-200, and potential N-gJycosylation sites at about amino acids 390-393 and 520-523. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 113 (SEQ ID NO: 179), revealed significant 
25 homology between the PRO 1 277 amino acid sequence and Dayhoff sequence no AF012252J. Homology was 
also found between the PRO 1277 amino acid sequence and the following Dayhoff sequences: AF006740J, 
CA36_HUMAN, HSU1J, HUMCOL7A1XJ, CAI7_HUMAN, MMZ78163J, CAMA_CHICK, 
HSU69263J , YNX3CAEEL, and MMRNAM31 . 

Clone DNA56869-1545 has been deposited with ATCC and is assigned ATCC deposit no. 203161. 

30 

EXAMPLE 48: Isolation of cDNA Clones Encoding Human PROll^ 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA52767. Based on the DNA52767 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
35 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PROH35. 
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In order 10 screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with PCR primer pairs prepared based upon the DN A52767 sequence. A positive 
library was then used to isolate clones encoding the PRO 1 135 gene using the probe oligonucleotide and one of 
the PCR primers. RNA for construction of the cDN A libraries was isolated from human coronary artery smooth 
muscle tissue (LIB309). The cDNA libraries used to isolate the cDNA clones were constructed by standard 
5 methods using commercially available reagents such as those from lnvitrogen, San Diego, CA. The cDNA was 
primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, 
sized appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such 
as pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., 
Science , 253:1278-1280 (1991)) in the unique Xhol and NotI sites. 
10 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01135 [herein designated as DNA56870-1492] (SEQ ID NO: 180) and the derived protein sequence for 
PR01135. 

The entire nucleotide sequence of DNA56870-1492 is shown in Figure 1 14 (SEQ ID NO: 180). Clone 
DNA56870-1492 contains a single open reading frame with an apparent translational initiation site at nucleotide 

15 positions 62-64 and ending at the stop codon at nucleotide positions 1685-1687 (Figure 114). The predicted 
polypeptide precursor is 541 amino acids long (Figure 1 15). The full-length PROl 135 protein shown in Figure 
115 has an estimated molecular weight of about 60,335 daltons and a pi of about 5.26. Analysis of the full- 
length PROl 135 sequence shown in Figure 115 (SEQ ID NO: 181) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about aino acid 21, potential N-glycosylaiion sited from about amino 

20 acid 53 to about amino acid 56, from about amino acid 75 to about amino acid 78, from about amino acid 252 
to about amino acid 255 and from about amino acid 413 to about amino acid 416 and an amino acid block having 
homology to glycosyl hydrolase family 35 proteins from about amino acid 399 to about amino acid 414. Clone 
DNA56870-1492 has been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 209925. 
Analysis of the amino acid sequence of the full-length PRO 11 35 polypeptide suggests that it possesses 

25 significant sequence similarity to the alpha 1 ,2-mannosidase protein, thereby indicating that PRO 11 35 may be 
a novel mannosidase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PROl 135 amino acid sequence and the following Dayhoff 
sequences, DMC86E4_5, D86967J, SPAC23A1_4, YH04_YEAST,B54408,SSMAN9MAN_1, CEZC4104, 
S61631 and MSU14190J. 

30 

EXAMPLE 49 : Isolation of cDNA Clones Encoding Human PROl 1 14 

A cDN A sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 
BLAST-2 sequence alignment computer program, to have certain sequence identity to other known interferon 
receptors. This cDNA sequence is herein designated DNA48466 and is shown in Figure 1 18 (SEQ ID N0:184). 
35 Based on the sequence identity, probes were generated from the sequence of the DNA48466 molecule and used 
to screen a human breast carconoma library (LIB 1 35) prepared as described in paragraph 1 of Example 2 above. 
The cloning vector was pRKSB (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes 
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ct ah, Science . 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PCR primer 5 ' -A GGCTTCGCTGCG ACTAG ACCTC-3 ' (SEQ ID NO:185) 
reverse PCR primer 5-CCAGGTCGGGTAAGGATGGTTGAG-3' (SEQ ID NO: 186) 
hybridization probe 

5 S'-TTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC-S' (SEQ ID NO:187) 
A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 250-252, and a stop signal at nucleotide positions 1 183-1185 
(Figure 116, SEQ ID NO: 182). The predicted polypeptide precursor is 31 1 amino acids long, has a calculated 
molecular weight of approximately 35,076 daltons and an estimated pi of approximately 5.04. Analysis of the 

1 0 full-length PRO 1 1 14 interferon receptor sequence shown in Figure 1 1 7 (SEQ ID NO: 1 83) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 29, a transmembrane domain 
from about amino acid 230 to about amino acid 255, potential N-glycosylation sites from about amino acid 40 
to about amino acid 43 and from about amino acid 134 to about amino acid 137, an amino acid sequence block 
having homology to tissue factor proteins from about amino acid 92 to about amino acid 1 19 and an amino acid 

15 sequence block having homology to integrin alpha chain proteins from about amino acid 232 to about amino acid 
262. Clone DNA57033-I403 has been deposited with ATCC on May 27, 1998 and is assigned ATCC deposit 
no. 209905. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a VVU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 1 17 (SEQ ID NO: 183), evidenced significant 
20 homology between the PRO 1114 interferon receptor amino acid sequence and the following Dayhoff sequences: 
G01418, INRl_MOUSE, P_R71035, INGS_HUMAN, A26595J. A26593J. 156215 and TF HUMAN 

EXAMPLE 50 : Isolation of cDNA Clones Encoding Human PRQ828 

A consensus DNA sequence was identified using the method described in Example 1 above. This 
25 consensus sequence is herein designated DNA35717. Based on the DNA357I7 consensus sequence, 
oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, 
and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0828. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-GCAGGACTTCTACGACTTCAAGGC-3' (SEQ ID NO: 190); and 
30 reverse PCR primer 5 *-AGTCTGGGCCAGGTACTTGAAGGC-3 ' (SEQ ID NO: 191). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35717 
sequence which had the following nucleotide sequence: 
hybridization probe 

5 -CAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGT-3' (SEQ ID NO:192) 
35 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0828 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0828 [herein designated as DNA57037-1444] (SEQ ID NO: 188) and the derived protein sequence for 
PR0828. 

The entire nucleotide sequence of DNA57037-1444 is shown in Figure 1 19 (SEQ ID NO: 1 88). Clone 
DNA57037-1444 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 34-36 and ending at the stop codon at nucleotide positions 595-597 (Figure 119). The predicted 
polypeptide precursor is 187 amino acids long (Figure 120). The full-length PR0828 protein shown in Figure 
120 has an estimated molecular weight of about 20,996 daltons and a pi of about 8.62. Analysis of the full- 
length PR0828 sequence shown in Figure 120 (SEQ ID NO: 189) evidences the presence of the following: a 
signal peptide at about amino acids 1-21; sequences identity to glutathione peroxidases signature 2 at about 
amino acids 82-89; sequence identity to glutathione peroxidases selenocysteine proteins at about amino acids 35- 
60, 63-100, 107-134, and 138-159. Clone DNA57037-1444 has been deposited with ATCC on May 27, 1998, 
and is assigned ATCC deposit no. 209903. 

analysis of the amino acid sequence of the full-length PR0828 polypeptide suggests that it possesses 
significant sequence similarity to glutathione peroxidases, thereby indicating that PR0828 may be a novel 
peroxidase enzyme. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0828 amino acid sequence and the following Dayhoff sequences: 
AF053311_l t CELT09A12_2, AC004151 __3, BTUE_ECOLI, CER05H103, P_P80918, PWU88907J, and 
P_W22308. 

EXAMPLE 51 : Isolation of cDNA clones Encoding Human PRO 1009 

A cDNA clone (DNA57 129-1413) encoding a native human PRO1009 polypeptide was identified by 
the use of a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that preferentially 
represents the 5' ends of the primary cDNA clones. First SEQ ID NO: 195 (Figure 123) was identified, which 
was extended by alignments to other EST sequences to form a consensus sequence. Oligonucleotide probes 
based upon the consensus sequence were synthesized and used to screen the cDNA library which gave rise to 
the full-length DNA57129-1413 clone. 

The full length DNA57129-1413 clone shown in Figure 121 contained a single open reading frame with 
an apparent translational initiation site at nucleotide positions 41-43 and ending at the stop codon found at 
nucleotide positions 1886-1888 (Figure 121; SEQ ID NO: 193). The predicted polypeptide precursor (Figure 
122, SEQ ID NO:194) is 615 amino acids long. Figure 122 also shows the approximate locations of the signal 
sequence, transmembrane domains, myristoylation sites, a glycosylation site and an AMP-binding domain. 
PRO 1 009 has a calculated molecular weight of approximately 68,125 daltons and an estimated pi of 
approximately 7.82. Clone DNA57129-1413 has been deposited with ATCC and is assigned ATCC deposit no. 
209977. It is understood that the deposited clone has the actual and correct sequence and that the representations 
herein may have minor, normal sequencing errors. 
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Based on a WU-BLAST-2 sequence alignment analysis (using the ALIGN computer program) of the 
full-length sequence. PRO 1009 shows amino acid sequence identity to at least the following proteins which were 
designated in a Dayhoff database as follows: F69893, CEF28F8_2, BSY13917J7. BSY13917J7, D69187, 
D69649, XCRPFBJ, E64928, YDID_ECOLI, BNACSF8J and RPU75363_2. 

5 KX AMPLE 52 : Isolation of cDNA Clones Encoding Human PRO1007 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated as DNA40671. 

In light of an observed sequence homology between the DNA40671 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T70513, the Merck EST clone T70513 was purchased 
10 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 124. 

The enure nucleotide sequence of DNA5 7690- 1374 is shown in Figure 124 (SEQ ID NO:196). Clone 
DNA57690-1374 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 16-18 and ending at the stop codon at nucleotide positions 1054-1056 (Figure 124). The predicted 
15 polypeptide precursor is 346 amino acids long (Figure 125). The full-length PRO 1007 protein shown in Figure 
125 has an estimated molecular weight of about 35,971 daltons and a pi of about 8. 17. Clone DNA57690-1374 
has been deposited with the ATCC on June 9, 1998. Regarding the sequence, it is understood that the deposited 
clone contains the actual sequence, and the sequences provided herein are based on known sequencing 
techniques. The representative figures herein show the representative numbering. 
20 Analysis of the amino acid sequence of the full-length PRO 1007 polypeptide suggests that portions of 

it possess sequence identity to MAGPIAP, thereby indicating that PRO 1007 may be a novel member of the 
family to which MAGPIAP belongs. . 

Still analyzing the amino acid sequence of SEQ ID NO: 197, the putative signal peptide is at about amino 
acids 1 30 of SEQ ID NO: 197. The transmembrane domain is at amino acids 325-346 of SEQ ID NO: 197. N- 
25 glycosylation sites are at about amino acids 118-121, 129-132, 163-166, 176-179, 183-186 and 227-130 of SEQ 
ID NO: 197. Ly-6/u-Par domain protein homology is at about amino acids 17-36 and 209-222 of SEQ ID 
NO: 1 97. The corresponding nucleotides of the amino acids presented herein can be routinely determined given 
the sequences provided herein. 

30 EXAMPLE 53 : Isolation of cDNA clones Encoding Human PRO 1056 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as 6425. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (Lifeseq*, Incyte Pharmaceuticals, Palo Alto, CA) to identify 

35 existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al.. Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
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consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA55736. 

In light of an observed sequence homology between the DNA55736 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R88049, the Merck EST clone R88049 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
5 The sequence of this cDNA insert is shown in Figure 126 and is herein designated as DNA57693-1424. 

Clone DNA57693-1424 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 56-58 and ending at the stop codon at nucleotide positions 416-41 8 (Figure 126), The 
predicted polypeptide precursor is 120 amino acids long (Figure 127). The full-length PRO 1056 protein shown 
in Figure 127 has an estimated molecular weight of about 13,345 daltons and a pi of about 5.18. Analysis of 

10 the full-length PRO1056 sequence shown in Figure 127 (SEQ ID NO: 199) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 18, a transmembrane domain from about 
amino acid 39 to about amino acid 58, a potential N-glycosylation site from about amino acid 86 to about amino 
acid 89, protein kinase C phosphorylation sites from about amino acid 36 to about amino acid 38 and from about 
amino acid 58 to about amino acid 60, a tyrosine kinase phosphorylation site from about amino acid 25 to about 

15 amino acid 32 and an amino acid sequence block having homology to channel Forming colicin proteins from 
about amino acid 24 to about amino acid 56. Clone DNA57693-1424 has been deposited with ATCC on June 
23, 1998 and is assigned ATCC deposit no. 203008. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35). using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 127 (SEQ ID NO: 199), evidenced significant 

20 homology between the PRO1056 amino acid sequence and the following Dayhoff sequences: PLMHUMAN, 
A40533, ATNG HUMAN, A55571, ATNGSHEEP, S31524, GEN13025, RIC_MOUSE, A48678 and 
A10871J. 

EXAMPLE 54 : Isolation of cDNA clones Encoding Human PRQ826 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 47283. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

30 et ah, Methods in Enzvmology 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56000. 

In light of an observed sequence homology between the DNA56000 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. W69233, the Merck EST clone W69233 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 128 and is herein designated as DNA57694-1341. 
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Clone DNA57694-1341 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 13-15 and ending at the stop codon at nucleotide positions 310-3 12 (Figure 128). The 
predicted polypeptide precursor is 99 amino acids long (Figure 129). The full-length PR0826 protein shown 
in Figure 129 has an estimated molecular weight of about 1 1,050 daltons and a pi of about 7.47. Analysis of 
the full-length PR0826 sequence shown in Figure 129 (SEQ ID NO:20l) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 22, potential N-myristoylation sites from 
about amino acid 22 to about amino acid 27 and from about amino acid 90 to about amino acid 95 and an amino 
acid sequence block having homology to peroxidase from about amino acid 16 to about amino acid 48. Clone 
DNA57694-1341 has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no. 203017. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 129 (SEQ ID NO:201), evidenced significant 
homology between the PR0826 amino acid sequence and the following Dayhoff sequences: CCU12315_1, 
SCU96108_6, CELF39F10_4 and HELTHELHO. 

fcX AMPLE 55 : isolation of cDNA clones Encoding Human PROS 19 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 49605. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. ( Methods in Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56015. 

In light of an observed sequence homology between the DNA560I5 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H65785, the Merck EST clone H65785 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 130 and is herein designated as DNA57695-1340. 

Clone DNA57695-I340 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 202-204 (Figure 130). The 
predicted polypeptide precursor is 52 amino acids long (Figure 131). The full-length PR0819 protein shown 
in Figure 131 has an estimated molecular weight of about 5 T 216 daltons and a pi of about 4.67. Analysis of the 
full-length PR0819 sequence shown in Figure 131 (SEQ ID NO:203) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 24, a potential N-myristoytation site from about 
amino acid 2 to about amino acid 7 and a region having homology to immunoglobulin light chain from about 
amino acid 5 to about amino acid 33. Clone DNA57695-1340 has been deposited with ATCC on June 23, 1998 
and is assigned ATCC deposit no. 203006. 
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An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 131 (SEQ ID NO:203), evidenced significant 
homology between the PR0819 amino acid sequence and the following Dayhoff sequences: HSU03899 1, 
HUMIGLITEBJ, VG28_HSVSA, AF031522 1, PAD 1 YEA ST and AF045484 L 

EXAMPLE 56 : Isolation of cDNA Clones Encoding Human PRO 1006 

An initial candidate sequence from Incyte cluster sequence no. 45748 was identified using the signal 
algorithm process described in Example 3 above. This sequence was then aligned with a variety of public and 
Incyte EST sequences and a consensus sequence designated herein as DNA56036 was derived therefrom. 

In light of an observed sequence homology between the DNA56036 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 489737, the Merck EST clone 489737 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 132. 

The entire nucleotide sequence of DNA57699-1412 is shown in Figure 132 (SEQ ID NO:204). Clone 
DNA57699-1412 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 28-30 and ending at the stop codon at nucleotide positions 1204-1206 "(Figure 132). The predicted 
polypeptide precursor is 392 amino acids long (Figure 133). The full-length PRO 1006 protein shown in Figure 
133 has an estimated molecular weight of about 46,189 daltons and a pi of about 9.04. Clone DNA57699-1412 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO: 205, the putative signal peptide is at about amino 
acids 1-23 of SEQ ID NO:205. The N-glycosylation sites are at about amino acids 40-43, 53-56, 204-207 and 
373-376 of SEQ ID NO:205. An N-myristoylation site is at about amino acids 273-278 of SEQ ID NO:205. 

The corresponding nucleotides of these amino acid regions and others can be routinely determined given the 
sequences provided herein. 

EXAMPLE 57 : Isolation of cDNA Clones Encoding Human PROl 1 12 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a specific 
EST cluster sequence. This EST cluster sequence was then compared to a variety of expressed sequence tag 
(EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 
(LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was 
performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmology 266:460- 
480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" 
(Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is 
herein designated DNA56018. 

In light of an observed sequence homology between the DNA56018 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA223546, the Merck EST clone AA223546 was 
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purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 134 and is herein designated as DNA57702-1476. 

The entire nucleotide sequence of DNA57702-1476 is shown in Figure 134 (SEQ ID NO:206). Clone 
DNA57702-1476 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 20-22 and ending at the stop codon at nucleotide positions 806-808 of SEQ ID NO:206 (Figure 134). 
The predicted polypeptide precursor is 262 amino acids long (Figure 135). The full-length PROI112 protein 
shown in Figure 135 has an estimated molecular weight of about 29,379 daltons and a pi of about 8.93. Figure 
135 also shows the approximate locations of the signal peptide and transmembrane domains. Clone DNA57702- 
1476 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone has the actual 
nucleic acid sequence and that the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 1 12 polypeptide suggests that it possesses 
some sequence similarity to other proteins. More specifically, an analysis of the Dayhoff database (version 
35.45 SwissProt 35) evidenced some sequence identity between the PROl 1 12 amino acid sequence and at least 
the following Dayhoff sequences, MTY20B1113 (a mycobacterium tuberculosis peptide), F64471, 
AE000690_6, XLU16364_1, E43259 (H+ -transporting ATP synthase) and PIGSLADRXEJ (MHC class II 
histocompatibility antigen). 

EXAMPLE 58 : Isolation of cDNA clones Encoding Huma n PRO 1074 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte cluster sequence No. 42586). This cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56251. 

In light of an observed sequence homology between the DNA56251 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA081912, the Merck EST clone AA0819I2 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 136 and is the full-length DNA sequence for 
PROI074. Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC 
deposit no. 209953. 

The entire nucleotide sequence of DNA57704-1452 is shown in Figure 136 (SEQ ID NO:208). Clone 
DNA57704-1452 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 322-324 and ending at the stop codon at nucleotide positions 1315-1317 (Figure 136). The predicted 
polypeptide precursor is 331 amino acids long (Figure 137). The full-length PROI074 protein shown in Figure 
137 has an estimated molecular weight of about 39,512 Daltons and a pi of about 8.03. Analysis of the full- 
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length PRO1074 sequence shown in Figure 137 (SEQ ID NO:209) evidences the presence of the following 
fearures: a transmembrane domain at about amino acids 20 to 39; potential N -glycosylate sites at about amino 
acids 72 to 75, 154 to 157, 198 to 201, 212 to 215, and 326 to 329; a glycosaminoglycan attachment site at about 
amino acids 239 to 242, and a Ly-6/u-PAR domain at about amino acids 23 to 36. 

Analysis of the amino acid sequence of the full-length PRO1074 polypeptide suggests that it possesses 

5 significant sequence similarity to beta 1 ,3-galactosyltransferase, thereby indicating that PRO 1074 may be a novel 
member of the galactosyltransf erase family of proteins. Analysis of the amino acid sequence of the full-length 
PRO 1074 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between 
the PRO1074 amino acid sequence and the following Dayhoff sequences: AF029792J, P_R57433, 
DMU41449J, AC000348J4, P_R47479, CET09F5J, CEF14B6_4. CET15D6_5, CEC54C8_4, and 

10 CEE03H4J0. 

Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209953. 

EXAMPLE 59 : Isolation of cDNA clones Encoding Hum an PRO 1005 

15 use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the LIFESEQ* database, Incyte cluster sequence no. 49243. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 

20 BLAST2 (Altshul el al. t Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56380. 

In light of an observed sequence homology between the DNA56380 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. AA256657, the Merck EST clone AA256657 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 138 and is herein designated as DNA57708-141I . 

The full length clone shown in Figure 138 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 30-32 and ending at the stop codon found at nucleotide positions 

30 585-587 (Figure 138; SEQ ID NO:210). The predicted polypeptide precursor (Figure 139, SEQ ID NO:211) 
is 185 amino acids long. PRO1005 has a calculated molecular weight of approximately 20,331 daltons and an 
estimated pi of approximately 5.85. Clone DNA57708-1411 was deposited with the ATCC June 23, 1998, and 
is assigned ATCC deposit no. 203021 . 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 139 (SEQ ID NO:211), evidenced some 
homology between the PRO 1005 amino acid sequence and the following Dayhoff sequences: DDU07187_1, 
DDU87912J, CELD1007J4, A42239, DDU42597J, CYAG_DICDI, S50452, MRKCKLEPN, P-R41998, 
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and XYNA_RUMFL. 

EXAMPLE 60 : Isolation of cDNA clones E ncoding Human PRO1073 

An initial DNA sequence referred to herein as DNA55938 and shown in Figure 142 (SEQ ID N0:214) 
was identified using a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that 
5 preferentially represents the 5 ' ends of the primary cDNA clones. DNA55938 was then compared to ESTs from 
public databases (e.g., GenBank), and a proprietary EST database (LIFESEQ* Incyte Pharmaceuticals, Palo 
Alto, CA), using the computer program BLAST or BLAST2 [ Altschul et al., Methods in EnzvmQlogY, 266:460- 
480 (1996)]. The ESTs were clustered and assembled into a consensus DNA sequence using the computer 
program "phrap" (Phil Green. University of Washington, Seattle, Washington). The consensus sequence 
1 0 obtained is designated herein as DNA564 11. 

In light of an observed sequence homology between the DNA5641 1 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H86027, the Merck EST clone H86027 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 140. 
15 The full length DNA57710-1451 clone shown in Figure 140 contained a single open reading frame with 

an apparent translational initiation site at nucleotide positions 345-347 and ending at the stop codon found at 
nucleotide positions 1242-1244 (Figure 140; SEQ ID NO:212). The predicted polypeptide precursor (Figure 
141, SEQ ID NO:213) is 299 amino acids long. PRO 1073 has a calculated molecular weight of approximately 
34,689 daltons and an estimated pi of approximately 11.49. The PRO1073 polypeptide has the following 
20 additional features: a signal peptide at about amino acids 1-31, sequence identity to bZIP transcription factor 
basic domain signature at about amino acids, a potential N-glycosylation site at about amino acids 2-5, and 
sequence identity with protamine PI proteins at about amino acids 158-183. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 141 (SEQ ID NO:213), revealed some sequence 
25 identity between the PRO1073 amino acid sequence and the following Dayhoff sequences: MMU37351_1, 
ATAC00250510T9J22.10, S59043, ENXNUPR1, B47328, SR55_DROME, S26650, SON_HUMAN, 
VIT2 CHICK, and XLC4SRPRTJ. 

Clone DNA57710-1451 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit 
no. 203048. 

30 

EXAMPLE 61 isolation of cDNA clones Encoding Human PROI 152 

A cDNA clone (DNA5771 1-1501) encoding a native human PROI 152 polypeptide was identified by 
employing a yeast screen, in a human infant brain cDNA library that preferentially represents the 5' ends of the 
primary cDNA clones. Specifically, a yeast screen was employed to identify a cDNA designated herein as 
35 DNA55807 (SEQ ID NO:217; see Figure 145). 

In light of an observed sequence homology between the DNA55807 sequence and an EST sequence 
encompassed within the Merck EST clone no. R56756, the Merck EST clone R56756 was purchased and the 
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cDNA insert was obtained and sequenced. It was found thai this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 143. 

The full-length DNA5771 1-1501 clone shown in Figure 143 contains a single open reading frame with 
an apparent translational initiation site at nucleotide positions 58-60 and ending at the stop codon at nucleotide 
positions 1495-1497 (Figure 143). The predicted polypeptide precursor is 479 amino acids long (Figure 144). 
5 The full-length PROI 152 protein shown in Figure 144 has an estimated molecular weight of about 53,602 daltons 
and a pi of about 8.82. Analysis of the full-length PR01152 sequence shown in Figure 144 (SEQ ID NO:2 16) 
evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 28, 
transmembrane domains from about amino acid 133 to about amino acid 155, from about amino acid 168 to 
about amino acid 187, from about amino acid 229 to about amino acid 247, from about amino acid 264 to about 

10 amino acid 285, from about amino acid 309 to about amino acid 330, from about amino acid 371 to about amino 
acid 390 and from about amino acid 441 to about amino acid 464, potential N-glycosylation sites from about 
amino acid 34 to about amino acid 37 and from about amino acid 387 to about amino acid 390 and an amino acid 
sequence block having homology to a respiratory-chain NADH dehydrogenase subunit from about amino acid 
24i to aoout amino acid 287. Clone DNA5771 1-1501 has been deposited with ATCC on July 1, 1998 and is 

15 assigned ATCC deposit no. 203047. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 144 (SEQ ID NO:216), evidenced significant 
homology between the PROI 152 amino acid sequence and the following Dayhoff sequences: AF052239_1, 
SYNN9CGAJ, SFCYTB21, GEN12507, P_R11769, MTVQ25J09, C61168, S43171, P_P61689 and 

20 P_P61696. 

EXAMPLE 62 : Isolation of cDNA clones Encoding Human PROI 136 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 109142. This EST cluster sequence was then compared 

25 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

30 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56039. 

In light of an observed sequence homology between the DNA56039 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. HSCINF011, the Merck EST clone HSC1NF01 1 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 

35 protein. The sequence of this cDNA insert is shown in Figure 146 and is herein designated as DNA57827-1493. 

Clone DNA57827-1493) contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 216-218 and ending at the stop codon at nucleotide positions 21 12-2114 (Figure 146). 
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The predicted polypeptide precursor is 632 amino acids long (Figure 147). The full-length PROH36 protein 
shown in Figure 147 has an estimated molecular weight of about 69,643 daltons and a pi of about 8.5 . Analysis 
of the full-length PR01136 sequence shown in Figure 147 (SEQ ID NO:219) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 15 and potential N-glycosylation sites 
from about amino acid 108 to about amino acid 11, from about amino acid 157 to about amino acid 160, from 
5 about amino acid 289 to about amino acid 292 and from about amino acid 384 to about amino acid 387. Clone 
DNA57827-1493 has been deposited with ATCC on July 1, 1998 and is assigned ATCC deposit no. 203045. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 147 (SEQ ID NO:219), evidenced significant 
homology between the PR01136 amino acid sequence and the following Dayhoff sequences: AF034746J, 
10 AF034745J, MMAF000168J9, HSMUPP1J, AFO60539J, SP97_RAT, 138757, MMU93309J, 
CEK01A6 4 and HSA224747J. 

EXAMPLE 63 : Isolation of cDNA clones Encoding Human PRQ813 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 Incyte EST cluster sequence (Incyte EST cluster sequence no. 45501. The Incyte EST cluster sequence no. 
45501 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ ™, Incyte Pharmaceuticals, Palo 
Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting 

20 in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56400. 

In light of an observed sequence homology between the DNA56400 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T90592, the Merck EST clone T90592 was purchased 

25 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 148 and is herein designated DNA57834-1339. 

The full length clone shown in Figure 148 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 109-111 and ending at the stop codon found at nucleotide 
positions 637-639 (Figure 149; SEQ ID NO:221). The predicted polypeptide precursor is 176 amino acids long, 

30 has a calculated molecular weight of approximately 19,616 daltons and an estimated pi of approximately 7.11. 
Analysis of the full-length PR0813 sequence shown in Figure 149 (SEQ ID NO:221) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 26 and potential N-myristoylation 
sites from about amino acid 48 to about amino acid 53, from about amino acid 153 to about amino acid 158, 
from about amino acid 156 to about amino acid 161 and from about amino acid 167 to about amino acid 172. 

35 Clone DNA57834- 1339 has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 
209954. 
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Analysis of the amino acid sequence of the full-length PR0813 polypeptide suggests that it possesses 
sequence similarity to the pulmonary surfactant- associated protein C. More specifically, an analysis of the 
Dayhoff database (version 35.45 SwissProt 35) evidenced some degree of homology between the PR0813 amino 
acid sequence and the following Dayhoff sequences ,PSPC_MUS VI, PP92071, G02964. P_R65489, P_P82977. 
P_R84555, S55542, MUSIGHAJJ and PHI 158. 

5 

EXAMPLE 64 : Isolation of cDNA Clones Encoding Human PRO809 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary EST DNA 

10 database (LIFE5EQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST! (Altshul et al., Methods in Enzvmology 
266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA56418. 

In light of an observed sequence homology between the DNA56418 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H74302, the Merck EST done H74302 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 150 and is herein designated DNA57836-1338. 

20 The entire nucleotide sequence of DNA57836- 1338 is shown in Figure 150 (SEQ ID NO:222). Clone 

DNA57836-1338 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 63-65 and ending at the stop codon at nucleotide positions 858-860 of SEQ ID NO:222 (Figure 150). 
The predicted polypeptide precursor is 265 amino acids long (Figure 151). The full-length PRO809 protein 
shown in Figure 151 has an estimated molecular weight of about 29,061 daltons and a pi of about 9.18. Figure 

25 151 further shows the approximate positions of the signal peptide and N-glysosylation sites. The corresponding 
nucleotides can be determined by referencing Figure 150. Clone DNA57836-1338 has been deposited with 
ATCC on June 23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 
the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO809 polypeptide suggests that it possesses 

30 some sequence similarity to the heparin sulfate proteoglycan and to endothelial cell adhesion molecule- 1 . More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PR0809 amino acid sequence and the following Dayhoff sequences, PGBM_MOUSE, D82082_l 
and PW14158. 

35 EXAMPLE 65 : Isolation of cDNA Clones Encoding Human PR0791 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
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sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary EST DNA 
database (LIFESEQ™, Incyie Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 (Altshul et al . , Methods in Enzvmologv 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
5 program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56429. 

In light of an observed sequence homology between the DNA56429 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 36367, the Merck EST clone 36367 was purchased and 
the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 

10 sequence of this cDNA insert is shown in Figure 152 and is herein designated DNA57838-1337. 

The entire nucleotide sequence of DNA57838-1337 is shown in Figure 152 (SEQ ID NO:224). Clone 
DNA57838-1337 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 9-11 and ending at the stop codon at nucleotide positions 747-749 of SEQ ID NO:224 (Figure 152). 
The predicted polypeptide precursor is 246 amino acids long (Figure 153). The full-length PR0791 protein 

15 shown in Figure 153 has an estimated molecular weight of about 27,368 daltons and a pi of about 7.45. Figure 
153 also shows the approximate locations of the signal peptide, the transmembrane domain, N-glycosylation 
sites and a region conserved in extracellular proteins. The corresponding nucleotides of one embodiment 
provided herein can be identified by referencing Figure 152. Clone DNA57838-1337 has been deposited with 
ATCC on June 23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 

20 the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0791 polypeptide suggests that it has sequence 
similarity with MHC-I antigens, thereby indicating that PR0791 may be related to MHC-I antigens. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some sequenc identity 
between the PR0791 amino acid sequence and the following Dayhoff sequences, AF034346 1 , MMQ1K5_1 and 

25 HFEHUMAN. 

EXAMPLE 66 : Isolation of cDNA clones Encoding Human PRO 1004 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence, Incyte cluster sequence No. 7368 1 . This EST cluster sequence was then compared 

30 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (AJtshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, Univ. of Washington, Seattle, 

35 Washington). The consensus sequence obtained therefrom is herein designated as DNA56516. 

In light of an observed sequence homology between the DNA56516 consensus sequence and an EST 
sequence encompassed withm the Merck EST clone no. H43837, the Merck EST clone H43837 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 154. 

The full length clone shown in Figure 154 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 1 19-121 and ending at the stop codon at nucleotide positions 
464-466 (Figure 154; SEQ ID NO:226). The predicted polypeptide precursor is 115 amino acids long (Figure 
5 155; SEQ ID NO:227). The full-length PRO1004 protein shown in Figure 155 has an estimated molecular 
weight of about 13,649 daltons and a pi of about 9.58. Analysis of the full-length PRO 1004 sequence shown 
in Figure 155 (SEQ ID NO:227) evidences the presence of the following features: a signal peptide at about amino 
acids 1-24, a microbodies C-terminal targeting signal at about amino acids 1 13-1 15, a potential N-glycosylation 
site at about amino acids 71-74, and a domain having sequence identity with dihydrofolate reductase proteins at 
10 about amino acids 22-48. 

Analysis of the amino acid sequence of the full-length PRO 1004 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PRO 1004 amino acid sequence and the following 
Dayhoff sequences: CELR02D3 7, LECI_MOUSE, AF006691J, SSZ97390J. SSZ97395_1, and 
SSZ97400 1. 

15 Clone DNA57844-1410 was deposited with the ATCC on June 23, 1998," and is assigned ATCC deposit 

no. 203010. 

EXAMPLE 67 : Isolation of cDNA clones Encoding Human PROl 1 1 1 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 

20 was searched and an EST was identified which had homology to insulin-like growth factor binding protein. 

RNA for construction of cDN A libraries was isolated from human fetal brain. The cDNA libraries used 
to isolate the cDNA clones encoding human PROl 1 1 1 were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 
containing a Nod site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 

25 by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRKSD that does not contain the Sfil site; see, Holmes et al.. Science . 
251:1278-1280 (1991)) in the unique Xhol and NotI. 

The human fetal brain cDNA libraries (prepared as described above), were screened by hybridization 
with a synthetic oligonucleotide probe based upon the Incyte EST sequence described above: 

30 5 ' -CC ACCACCTGGAGGTCCTGCAGTTGGGC AGG AACTCCATCCGGCAGATTG-3 ' (SEQ ID NO:251). 

An identified cDNA clone was sequenced in entirety. The entire nucleotide sequence of PROl 1 1 1 is 
shown in Figure 156 (SEQ ID NO:228). Clone DNA58721-1475 contains a single open reading frame with an 
apparent translational initiation site at nucleotide positions 57-59 and a stop codon at nucleotide positions 2016- 
2018 (Figure 156; SEQ ID NO:228). The predicted polypeptide precursor is 653 amino acids long (Figure 157). 

35 The transmembrane domains are at positions 21-40 (type II) and 528-548. Clone DNA58721-1475 has been 
deposited with ATCC and is assigned ATCC deposit no. 2031 10. The full-length PROl 1 1 1 protein shown in 
Figure 157 has an estimated molecular weight of about 72,717 daltons and a pi of about 6.99. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 157 (SEQ ID NO:229), revealed some sequence 
identity between the PROl 1 1 1 amino acid sequence and the following Dayhoff sequences: A58532, D86983_l , 
RNPLGPV l, PGS2_HUMAN, AF038I27J, ALS_MOUSE. GPV HUMAN, PGS2_BOVIN, ALS_PAPPA 
and 147020. 

5 

EXAMPLE 68 : Isolation of cDNA clones Encoding Human PRO 1344 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA33790. Based on the DNA33790 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
10 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01344. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' - AGGTTCGTGATGGAG AC AACCGCG -3 ' (SEQ ID NO:232) 
reverse PCR primer 5 '-TGTC AAGGACGC ACTGCCGTC ATG-3 ' (SEQ ID NO:233) 
15 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA33790 
sequence which had the following nucleotide sequence 
hybridization probe 

5 -TGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCC-3' (SEQ ID NO:234) 
Li order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

20 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1344 gene using the probe oligonucleotide and one of the PCR primers. RNA 
fo; construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROB44 (designated herein as DNA58723-1588 [Figure 158. SEQ ID NO:230)); and the derived protein 

25 sequence for PRO 1344. 

The entire nucleotide sequence of DNA5 8723- 1588 is shown in Figure 158 (SEQ ID NO:230). Clone 
DNA58723-1588 contains a single open reading frame with an apparent translations! initiation site at nucleotide 
positions 26-28 and ending at the stop codon at nucleotide positions 2186-2188 (Figure 158). The predicted 
polypeptide precursor is 720 amino acids long (Figure 159). The full-length PR01344 protein shown in Figure 

30 159 has an estimated molecular weight of about 80,199 daltons and a pi of about 7.77. Analysis of the full- 
length PRO 1344 sequence shown in Figure 159 (SEQ ID N0:231) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 23, an EGF-like domain cysteine protein signature 
sequence from about amino acid 260 to about amino acid 271, potential N-glycosylation siies from about amino 
acid 96 to about amino acid 99, from about amino acid 279 to about airiino acid 282, from about amino acid 316 

35 to about amino acid 319, from about amino acid 451 to about amino acid 454 and from about amino acid 614 
to about amino acid 617, an amino acid sequence block having homology to serine proteases, trypsin family from 
about amino acid 489 to about amino acid 505 and a CUB domain protein profile sequence from about amino 
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acid 150 to about amino acid 166. Clone DNA58723-1588 has been deposited with ATCC on August 18, 1998 
and is assigned ATCC deposit no. 203133. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 159 (SEQ ID NO:231), evidenced significant 
homology between the PR01344 amino acid sequence and the following Dayhoff sequences: S77063_l, 
5 CRAR_MOUSE, P_R74775, P_P90070, P_R09217, P_P70475, HSBMP16J and U50330J. 

EXAMPLE 69 ; Isolation of cDNA clones Encoding Human PROl 109 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA52642. The consensus DNA sequence 

1 0 was obtained by extending using repeated cycles of BLAST and phrap a previously obtained consensus sequence 
as far as possible using the sources of EST sequences discussed above. Based on the DNA52642 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 11 09. 
PCR primers (forward and reverse) were synthesized: 

15 forward PCR primer 5 '-CCTTACCTCAGAGGCC AG AGC A A GC-V (SEQ ID NO:237) 
reverse PCR primer 5' -GAGCTTCATCCGTTCTGCGTTC ACC-3 ' (SEQ ID NO:238) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA52642 
sequence which had the following nucleotide sequence 
hybridization probe 

20 5 '-CAGGAATGTAAAGCTTTACAGAGGGTCGCCATCCTCGTTCCCC ACC-3 ' (SEQ ID NO:239) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 109 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human SK-Lu-1 adenocarcinoma cell tissue (LIB247). 

25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 109 (designated herein as DNA58737-1473 [Figure 160. SEQ ID NO:235]) and the derived protein 
sequence for PROl 109. 

The entire nucleotide sequence of DNA58737-1473 is shown in Figure 160 (SEQ ID NO:235). Clone 
DNA58737-1473 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 

30 positions 1 1 9-120 and ending at the stop codon at nucleotide positions 1 15 1- 1153 (Figure 160). The predicted 
polypeptide precursor is 344 amino acids long (Figure 161). The full-length PROl 109 protein shown in Figure 
161 has an estimated molecular weight of about 40,041 daltons and a pi of about 9.34. Analysis of the full- 
length PROl 109 sequence shown in Figure 161 (SEQ ID NO:236) evidences the presence of the following: a 
signal peptide from about amino acid I to about amino acid 27, potential N-glycosylation sites from about amino 

35 acid 4 to about amino acid 7, from about amino acid 220 to about amino acid 223 and from about amino acid 
335 to about amino acid 338 and an amino acid sequence block having homology to xylose isomerase proteins 
from about amino acid 191 to about amino acid 201. Clone DNA58737-1473 has been deposited with ATCC 
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on August 18, 1998 and is assigned ATCC deposit no. 203136. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 161 (SEQ ID NO:236), evidenced significant 
homology between the PRO1109 amino acid sequence and the following Dayhoff sequences: HSUDPGALl, 
HSUDPB14J, NALS_BOVIN,HSU10473_1, CEW02B121 1 , YNJ4_CAEEL, AE000738J1, CET24D1J, 
5 S48121 and CEGLY9_1. 

EXAMPLE 70 : Isolation of cDNA clones Encoding Human PRO 1383 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53961. Based on the DNA53961 
10 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01383. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-CATTTCCTTACCCTGGACCCAGCTCC-3' (SEQ ID NO:242) 
15 reverse PCR primer 5 '-GAAAGGCCCAC AGCACATCTGGCAG-3 ' (SEQ ID NO:243) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA53961 
sequence which had the following nucleotide sequence 
hybridization probe 

5'-CCACGACCCGAGCAACTTCCTCAAGACCGACTTGTTTCTCTACAGC-3* (SEQ ID NO:244) 
20 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01383 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. • 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
25 PR01383 (designated herein as DNA58743-1609 [Figure 162, SEQ ID NO: 240)) and the derived protein 
sequence for PRO 1 383. 

The entire nucleotide sequence of DNA58743-1609 is shown in Figure 162 (SEQ ID NO:240). Clone 
DNA58743-1609 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 122-124 and ending at the stop codon at nucleotide positions 1391-1393 (Figure 162). The predicted 

30 polypeptide precursor is 423 amino acids long (Figure 163). The full-length PRO 1383 protein shown in Figure 
163 has an estimated molecular weight of about 46,989 daltons and a pi of about 6.77. Analysis of the full- 
length PRO 1383 sequence shown in Figure 163 (SEQ ID NO:241) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino acid 
339 to about amino acid 362, and potential N-glycosylation sites from about amino acid 34 to about amino acid 

35 37, from about amino acid 58 to about amino acid 61 1 from about amino acid 142 to about amino acid 145, from 
about amino acid 197 to about amino acid 200, from about amino acid 300 to about amino acid 303 and from 
about amino acid 364 to about amino acid 367. Clone DNA58743-1609 has been deposited with ATCC on 
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August 25, 1998 and is assigned ATCC deposit no. 203154. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 163 (SEQ ID NO:241), evidenced significant 
homology between the PROI383 amino acid sequence and the following Dayhoff sequences: NMB_HUMAN, 
QNRCOTJA, PW38335, P1I5_CHICK, P_W38164, A45993J, MMU70209J, D83704J andP_W39176. 

EXAMPLE 71 ; Isolation of cDNA Clones Encoding Human PRO 1 003 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 43055. This sequence was then compared to a variety of EST 
databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 
(LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search 
was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
tiwt did nor encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated consenOl. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2849382, the Incyte EST clone 2849382 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 164. 

The entire nucleotide sequence of DNA58846- 1409 is shown in Figure 164 (SEQ ID NO:245). Clone 
DNA58846-1409 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 41-43 and ending at the stop codon at nucleotide positions 293-295 (Figure 164). The predicted 
polypeptide precursor is 84 amino acids long (Figure 165). The full-length PRO 1003 protein shown in Figure 
165 has an estimated molecular weight of about 9,408 daltons and a pi of about 9.28. Analysis of the full-length 
PRO 1003 sequence shown in Figure 165 (SEQ ID NO:246) evidences the presence of a signal peptide at amino 
acids 1 to about 24, and a cAMP- and cGMP-dependent protein kinase phosphorylation site at about amino acids 
58 to about 61 . Analysis of the amino acid sequence of the full-length PRO 1003 polypeptide using the Dayhoff 
database (version 35.45 SwissProt 35) evidenced homology between the PRO 1003 amino acid sequence and the 
following Dayhoff sequences: AOPCZA363_3, SRTX_ATREN, A48298, MHVJHMSJ, VGL2_CVMJH, 
DHDHTC2 2, CORT RAT, TAL6 HUMAN, P_W14123, and DVUFI_2. 

EXAMPLE 72 : Isolation of cDNA Clones Encoding Human PROl 108 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53237. 

In light of an observed sequence homology between the DNA53237 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2379881, the Incyte EST clone 2379881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 166 and is herein designated DNA58848-1472. 

The entire nucleotide sequence of DNA58848-1472 is shown in Figure 166 (SEQ ID NO:247). Clone 
DNA58848-1472 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 1445-1447 (Figure 166). The predicted 
polypeptide precursor is 456 amino acids long (Figure 167). The full-length PROl 108 protein shown in Figure 
5 167 has an estimated molecular weight of about 52,071 daltons and a pi of about 9.46. Analysis of the full- 
length PRO 11 08 sequence shown in Figure 167 (SEQ ID NO:248) evidences the presence of the following: type 
II transmembrane domains from about amino acid 22 to about amino acid 42, from about amino acid 156 to 
about amino acid 176, from about amino acid 180 to about amino acid 199 and from about amino acid 369 to 
about amino acid 388, potential N-glycosyiaion sites from about amino acid 247 to about amino acid 250, from 

10 about amino acid 327 to about amino acid 330, from about amino acid 328 to about amino acid 331 and from 
about amino acid 362 to about amino acid 365 and an amino acid block having homology to ER lumen protein 
retaining receptor protein from about ainino acid 153 to about amino acid 190. Clone DNA58848-1472 has been 
deposited with ATCC on June 9, 1998 and is assigned ATCC deposit no. 209955. 

Analysis of the amino acid sequence of the full-length PROl 108 polypeptide suggests that it possesses 

15 significant sequence similarity to the LPAAT protein, thereby indicating that PRO1108 may be a novel LPAAT 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PRO 1108 amino acid sequence and the following Dayhoff sequences, 
AF015811J, CER07E3_2, YL35_CAEEL, S73863, CEF59F4_4, P_W06422, MMU41736J, MTV008_39, 
P_R99248 and Y67_BPT7. 

20 

EXAMPLE 73 : Isolation of cDNA Clones Encoding Human PROl 137 

The extracellular domain (ECD) sequences (including the secretion signal, if any) of from about 950 

known secreted proteins from the Swiss-Prot public protein database were used to search expressed sequence 

tag (EST) databases. The EST databases included public EST databases (e.g., GenBank) and a proprietary EST 
25 DNA database (UFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the 

computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)) as a 

comparison of the ECD protein sequences to a 6 frame translation of the EST sequence. Using this procedure, 

Incyte EST No. 3459449, also referred to herein as "DNA7I08", was identified as an EST having a BLAST 

score of 70 or greater that did not encode a known protein. 
30 A consensus DNA sequence was assembled relative to the DNA7108 sequence and other ESTs using 

repeated cycles of BLAST and the program "phrap" (Phil Green, Univ. of Washington, Seattle, WA). The 

consensus sequence obtained therefrom is referred to herein as DNA53952. 

In light of an observed sequence homology between the DNA53952 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3663102, the Incyte EST clone 3663102 was purchased 
35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 168. 
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The entire nucleotide sequence of DNA58849-1494 is shown in Figure 168 (SEQ ID NO:249). Clone 
DNA58849-1494 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 797-799 (Figure 168). The predicted 
polypeptide precursor is 240 amino acids long (Figure 169). The full-length PRO 1 137 protein shown in Figure 
169 has an estimated molecular weight of about 26,064 daltons and a pi of about 8.65. Analysis of the full- 
5 length PROl 137 sequence shown in Figure 169 (SEQ ID NO:250) evidences the presence of a signal peptide 
at about amino acids 1 to 14 and a potential N-glycosylation site at about amino acids 101-105. 

Analysis of the amino acid sequence of the full-length PRO 11 37 polypeptide suggests that it possesses 
significant sequence similarity to ribosyltransferase thereby indicating that PROl 137 may be a novel member 
of the ribosyltransferase family of proteins. Analysis of the amino acid sequence of the full-length PROl 137 
10 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PROl 137 
amino acid sequence and the following Dayhoff sequences: MMART5J, NARG_MOUSE, GEN11909, 
GEN13794, GENI4406, MMRNART62J, and P_R41876. 

EXAMPLE 74 : Isolation of cDNA clones Encoding Human PROl 138 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST sequence, Incyte cluster sequence no. 165212. This cluster sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a 
proprietary EST DNA database (UFESEQ™, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

20 et al., Methods in Enzvmology 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated as DNA54224. The assembly included a proprietary 
Genentech EST designated herein as DNA49140 (Figure 172; SEQ ID NO:254). 

25 In light of an observed sequence homology between the DNA54224 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3836613, the Incyte EST clone 3836613 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 170 and is the full-length DNA sequence for PROl 138. 
Clone DNA58850-1495 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 

30 209956. 

The entire nucleotide sequence of DNA58850-1495 is shown in Figure 170 (SEQ ID NO:252). Clone 
DNA58850-1495 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 38-40 and ending at the stop codon at nucleotide positions 1043-1045 (Figure 170). The predicted 
polypeptide precursor is 335 amino acids long (Figure 171). The full-length PROl 138 protein shown in Figure 
35 171 has an estimated molecular weight of about 37,421 Daltons and a pi of about 6.36. Analysis of the full- 
length PROl 138 sequence shown in Figure 171 (SEQ ID NO:253) evidences the presence of the following 
features: a signal peptide at about amino acid 1 to about amino acid 22; a transmembrane domain at about amino 
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acids 224 to about 250; a leucine zipper pattern at about amino acids 229 to about 250; and potential N- 
glyC osylauon S1 tes at about anuno acids 98-101, 142-145. 148-151, 172-175. 176-179. 204-207, and 29! -295. 

Analysis of the amino acid sequence of the full-length PRO 1 138 polypeptide suggests that it possesses 
significant sequence similarity to the CD84, thereby indicating that PROl 138 may be a novel member of the Ig 
superfamily of polypeptides. More particularly, analysis of the amino acid sequence of the full-length PROl 138 
polypeptide using the Dayhoff database (version35.45 SwissProt 35) evidenced homology between the PROl 138 
amino acid sequence and the following Dayhoff sequences: HSU82988.1 . HUMLY9 J . P.R9763 1 . P_R97628. 
P R97629. P_R97630, CD48_RAT, CD2_HUMAN, P_P93996. and HUMBGPJ. 

Clone DNA58850-1495 was deposited with ATCC on June 9, 1998. and is assigned ATCC deposit no. 

209956. 
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F Y AMPLE 75 : Isolation of cDNA clones Encoding Human PRO 1054 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the lncyte database, designated 66212. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 

15 and a proprietary EST DNA database (LIFESEQ*. lncyte Pharmaceuticals. Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al.. yipThorfs in Enzvmologv 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "pbrap" (Phil Green, University of Washington, Seattle. Washington). The 

20 consensus sequence obtained therefrom is herein designated DNA55722. 

In light of an observed sequence homology between the DNA55722 consensus sequence and an EST 
sequence encompassed within the lncyte EST clone no. 319751, the lncyte EST clone 319751 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 173 and is herein designated as DNA58853-1423. 

25 Clone DNA58853-1423 contains a single open reading frame with an apparent translation^ initiation 

site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 586-588 (Figure 173). The 
predicted polypeptide precursor is 180 amino acids long (Figure 174). The full-length PRO1054 protein shown 
in Figure 174 has an estimated molecular weight of about 20,638 daltons and a pi of about 5.0. Analysis of the 
full-length PRO1054 sequence shown in Figure 174 (SEQ ID NO:256) evidences the presence of the following: 

30 a signal peptide from about amino acid 1 to about amino acid 1 8, a leucine zipper pattern from about amino acid 
155 to about amino acid 176 and amino acid sequence blocks having homology to lipocalin proteins from about 
amino acid 27 to about amino acid 38 and from about amino acid 110 to about amino acid 120. Clone 
DNA58853-1423 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203016. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 174 (SEQ ID NO:256). evidenced significant 
homology between the PRO1054 amino acid sequence and the following Dayhoff sequences: MUP1 J40USE, 
MUP6_MOUSE ( MUP2_MOUSE, MUP8_MOUSE. MUP5_MOUSE, MUP4_MOUSE, S10124. 
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MUPMMOUSE, MUP RAT and ECU70823J. 

EXAMPLE 76 : Isolation of cDNA clones Encoding Human PRQ994 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 157555. This EST cluster sequence was then compared 
5 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et aL , Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
10 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55728. 

In light of an observed sequence homology between the DNA55728 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2860366, the Incyte EST clone 2860366 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
15 The sequence of this cDNA insert is shown in Figure 175 and is herein designated as DNA58855-I422. 

Clone DNA58855-1422 contains a single open reading frame with an apparent iranslational initiation 
site at nucleotide positions 3 1 -33 and ending at the stop codon at nucleotide positions 718-720 (Figure 175). The 
predicted polypeptide precursor is 229 amino acids long (Figure 176). The full-length PR0994 protein shown 
in Figure 176 has an estimated molecular weight of about 25.109 daltons and a pi of about 6.83. Analysis of 
20 the full-length PR0994 sequence shown in Figure 176 (SEQ ID NO:258) evidences the presence of the 
following: transmembrane domains from about amino acid 10 to about amino acid 31, from about amino acid 
50 to about amino acid 72, from about amino acid 87 to about amino acid 110 and from about amino acid 191 
to about amino acid 213, potential N-glycosylation sites from about amino acid 80 to about amino acid 83, from 
about amino acid 132 to about amino acid 135, from about amino acid 148 to about amino acid 151 and from 
25 about amino acid 163 to about amino acid 166 and an amino acid block having homology to TNFR/NGFR 
cysteine-rich region proteins from about amino acid 4 to about amino acid 1 1 . Clone DNA58855-1422 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203018. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 176 (SEQ ID NO: 258), evidenced significant 
30 homology between the PR0994 amino acid sequence and the following Dayhoff sequences: AF027204_I, 
TAL6HUMAN, ILT4HUMAN, JC6205, MMU57570J, S40363, ETU56093J, S42858, P_R66849 and 
PR74751. 

EXAMPLE 77 : Isolation of cDNA clones Encoding Human PRQ812 
35 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 170079. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
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and a proprietary EST DNA database (Lifeseq* Incytc Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
tt ai. , Methods in Enzvmologv 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
5 consensus sequence obtained therefrom is herein designated as DNA55721 . 

in light of an observed sequence homology between the DNA55721 consensus sequence and an EST 
sequence encompassed within the Incytc EST clone no. 388964, the Incyte EST clone 388964 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 177 and is herein designated as DNA59205-1421 . 

10 Clone DNA59205-1421 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 55-57 and ending at the stop codon at nucleotide positions 304-306 (Figure 177). The 
predicted polypeptide precursor is 83 amino acids long (Figure 178). The full-length PR0812 protein shown 
in Figure 178 has an estimated molecular weight of about 9,201 daltons and a pi of about 9.3. Analysis of the 
full-length PR0812 sequence shown in Figure 178 (SEQ ID NO:260) evidences the presence of the following: 

15 a signal peptide from about amino acid 1 to about amino acid 15, a cAMP- and cGMP-dependent protein kinase 
phosphorylation site from about amino acid 73 to about amino acid 76 and protein kinase C phosphorylation sites 
from about amino acid 70 to about amino acid 72 and from about amino acid 76 to about amino acid 78. Clone 
DNA59205-1421 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203009. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 178 (SEQ ID NO:260), evidenced significant 
homology berween the PR0812 amino acid sequence and the following Dayhoff sequences: P_W35802, 
P_W35803, PSCIRAT, S68231, GEN13917, PSC2 RAT, CC 1 OHUM AN, UTER_RABIT, AF008595_1 and 
A56413. 

25 EXAMPLE 78 : Isolation of cDNA clones Encoding Human PRO 1069 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence designated herein as 100727. This sequence was then compared to a proprietary EST 
DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

30 Enzvmologv 266:460-480 ( 1 996)) . Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 
therefrom is herein designated DNA56001. 

In light of an observed sequence homology berween the DNA56001 consensus sequence and an EST 

35 sequence encompassed within the Incyte EST clone no. 353388 1 , the Incyte EST clone 3533881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 179 and is the full-length DNA sequence for PRO1069. 
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Clone DNA5921 1-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 

209960. 

The entire nucleotide sequence of DN ASK! 11 -1450 is shown in Figure 179 (SEQ ID NO:261). Clone 
DNA5921 1-1450 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 197-199 and ending at the stop codon at nucleotide positions 464-466. The predicted polypeptide 

5 precursor is 89 amino acids long (Figure 180). The full-length PRO1069 protein shown in Figure 180 has an 
estimated molecular weight of about 9,433 daltons and a pi of about 8.21 . Analysis of the full-length PRO1069 
sequence shown in Figure 180 (SEQ ID NO:262) evidences the presence of the following features: a signal 
peptide sequence at amino acid i to about 16; a transmembrane domain at about amino acids 36 to about 59; 
potential N-myristoylation sites at about amino acids 4146, 45-50, and 84-89; and homology with extracellular 

10 proteins SCP/Tpx-l/Ag5/PR-l/Sc7 at about amino acids 54 to about 66. 

Analysis of the amino acid sequence of the full-length PRO1069 polypeptide suggests that it possesses 
significant sequence similarity to CH1F, thereby indicating that PRO1069 may be a member of the CHIF family 
of polypeptides. More particularly, analysis of the amino acid sequence of the full-length PRO1069 polypeptide 
using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PRO1069 amino acid 

15 sequence and the following Dayhoff sequences: CHIF RAT, A5557L PLM_HUMAN, A40533, 
ATNG BOVIN, RIC_MOUSE, PETDSYNY3, VTB1_XENLA, A05OO9, and S75086. 

Clone DNA5921 1-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209960. 

20 EXAMPLE 79 : Isolation of cDNA Clones E ncoding Human PROl 129 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
incyte EST cluster sequence designated herein as 98833. The Incyte EST cluster sequence no. 98833 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 

25 to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al.. Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56038. 

30 In light of an observed sequence homology between the DNA56038 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 1335241 , the Incyte EST clone 1335241 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 181 and is herein designated DNA59213-1487. 

The full length clone shown in Figure 181 contained a single open reading frame with an apparent 

35 translational initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
1614-1616 (Figure 181; SEQ ID NO:263). The predicted polypeptide precursor is 524 amino acids long, has 
a calculated molecular weight of approximately 60,310 daltons and an estimated pi of approximately 7.46. 
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Analysis of the full-length PRO 1129 sequence shown in Figure 182 (SEQ ID NO:264) evidences the presence 
of the following: type II transmembrane domains from about amino acid 13 to about amino acid 32 and from 
about amino acid 77 to about amino acid 102, a cytochrome P-450 cysteine heme-iron ligand signature sequence 
from about amino acid 461 to about amino acid 470 and potential N-glycosylation sites from about amino acid 
112 to about amino acid 115 and from about amino acid 168 to about amino acid 171. Clone DNA59213-1487 
5 has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209959. 

Analysis of the amino acid sequence of the full-length PROl 129 polypeptide suggests that it possesses 
sequence similarity to the cytochrome P-450 family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PROl 129 amino acid 
sequence and the following Dayhoff sequences, AC004523_I, S45702, AF054821J and 153015. 

10 

EXAMPLE 80 : Isolation of cDNA clones Encoding Human PRO1068 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the LIFESEQ® database, designated Incyte cluster no. 141736. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 

15 (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. One or more of the ESTs was derived from a human mast cell line from a patient 
with mast cell leukemia. The homology search was performed using the computer program BLAST or BLAST2 
(Altsbul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 

20 consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56094. 

In light of an observed sequence homology between the DNA56094 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 004974. the Incyte EST clone 004974 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 183 and is herein designated as DNA59214-1449 (SEQ 
ID NO:265). 

The full length clone shown in Figure 183 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
414-416 (Figure 183; SEQ ID NO:265). The predicted polypeptide precursor (Figure 184, SEQ ID NO:266) 

30 is 124 amino acids long. PRO1068 has a calculated molecular weight of approximately 14,284 daltons and an 
estimated pi of approximately 8.14. The PRO1068 polypeptide has the following additional features, as 
indicated in Figure 184: a signal peptide sequence at about amino acids 1-20. a urotensin II signature sequence 
at about amino acids 118-123, a cell attachment sequence at about amino acids 64-66, and a potential cAMP- 
and cGMP-dependent protein kinase phosphorylation site at about amino acids 112-1 15. 

35 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 184 (SEQ ID NO:266), revealed homology 
between the PRO 1068 amino acid sequence and the following Dayhoff sequences: HALBOPJ, MTV043J36, 
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150498, and P_R78445 

Clone DNA59214-1449 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit 
no.203046. 

RX AMPLE 81 : Isolation of cDNA clones Encoding Human PRO 1066 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

lncyte EST cluster sequence designated herein as 79066. The Incyte EST cluster sequence no. 79066 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 

10 BLAST2 (Altshul et al.. Methods in Enzvmologv 266:46XM80 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56121. 

in light of an observed sequence homology between the DNA56121 consensus sequence and an EST 

15 sequence encompassed within the incyte EST clone no. 1515315, the Incyte EST clone 1515315 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 185 and is herein designated DNA592I5-1425. 

The full length clone shown in Figure 185 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 176-178 and ending al the stop codon found at nucleotide 

20 positions 527-529 (Figure 1 85 ; SEQ ID NO:267). The predicted polypeptide precursor is 1 1 7 amino acids long, 
has a calculated molecular weight of approximately 12,91 1 daltons and an estimated pi of approximately 5.46. 
Analysis of the fall-length PRO1066 sequence shown in Figure 186 (SEQ ID NO:268) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a cAMP- and cGMP- 
dependent protein kinase phosphorylation site from about amino acid 38 to about amino acid 41 and potential 

25 N-myristoylation sites from about amino acid 5 to about amino acid 10, from about amino acid 63 to about amino 
acid 68 and from about amino acid 83 to about amino acid 88. Clone UNQ524 (DNA59215-1425) has been 
deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209961 . 

Analysis of the amino acid sequence of the full-length PRO 1066 polypeptide suggests that it does not 
possess significant sequence similarity to any known human protein. However, an analysis of the Dayhoff 

30 database (version 35.45 SwissProt 35) evidenced some degree of homology berween the PRO 1066 amino acid 
sequence and the following Dayhoff sequences, MOTI HUMAN, AF025667 1, MTCY19H9_8 and 
RABIGKCHJ. 



EXAMPLE 82 : Isolation of cDNA Clones Encoding Human PRO 11 84 
35 Use of the signal sequence algorithm described in Example 3 on ESTs from an Incyte database allowed 

identification a candidate sequence designated herein as DNA56375. This sequence was then compared to a 
variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
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a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzymologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phxap" (Phil Green, University of Washington, Seattle, Washington). The 
5 consensus sequence obtained therefrom is herein designated DNA56375. 

In light of an observed sequence homology between the DNA56375 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1428374, the Incyte EST clone 1428374 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 187. 

10 The full length clone shown in Figure 187 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 106-108 and ending at the stop codon found at nucleotide 
positions 532-534 (Figure 187; SEQ ID NO:269). The predicted polypeptide precursor is 142 amino acids long, 
has a calculated molecular weight of approximately 15,690 daltons and an estimated pi of approximately 9.64. 
Analysis of the full-length PROl 184 sequence shown in Figure 188 (SEQ ID NO:270) evidences the presence 

15 of a signal peptide at about amino acids 1-38. Clone DNA59220-1514 has been deposited with the ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual sequences and that representations are 
presented herein. 

Analysis of the amino acid sequence of the full-length PROl 184 polypeptide suggests that it possesses 
some sequence identity with a protein called TIM from Drosophila virilis, designated "DVTIMS02_r in the 
20 Dayhoff data base, (version 35.45 SwissProt 35). Other 

Dayhoff database (version 35.45 SwissProt 35) sequences having some degree of sequence identity with 
PROH84 include: WISl_SCHPO, F002186J, ATAC0O239124 and MSAIPRPJ. 

EXAMPLE 83 : Isolation of cDNA clones Encoding Human PRO 1360 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

sequence from an Incyte database, designated DNA 10572. This EST sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank, Merck/Wash. 
U.) and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

30 (Altshul et al . , Methods in Enzvmoiogv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA57314. 

In light of an observed sequence homology between the DNA57314 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. AA406443, the Merck EST clone AA406443 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 189 and is herein designated as DNA59488-1603. 
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The full length clone shown in Figure 189 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 54-56 and ending at the stop codon found at nucleotide positions 
909-911 (Figure 189; SEQ ID NO:27l). The predicted polypeptide precursor (Figure 190, SEQ ID NO:272) 
is 285 amino acids long. PRO 1360 has a calculated molecular weight of approximately 31,433 dahons and an 
estimated pi of approximately 7.32. Clone DNA59488-1603 was deposited with the ATCC on August 25, 1998 
5 and is assigned ATCC deposit no. 203157. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 190 (SEQ ID NO:272), revealed sequence identity 
between the PRO 1360 amino acid sequence and the following Dayhoff sequences: UN51CAEEL, 
YD4B_SCHPO. AF000634J, GFOZYMMO, YE1J_SCHP0, D86566J, ZMGFOJ, S76976, 
10 PPSA_SYNY3, and CEF28B1_4. 

EXAMPLE 84 ; Isolation of cDNA clones Encoding Human PRO 1029 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the lncyte database, designated 18763. This EST cluster sequence was then compared 
15 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ* lncyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmology 266:460-480 (1996^. Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
20 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57854. 

In light of an observed sequence homology between the DNA57854 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T98880, the Merck EST clone T98880 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
25 The sequence of this cDNA insert is shown in Figure 191 and is herein designated as DNA59493-1420. 

Clone DNA59493-1420 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 39-4 1 and ending at the stop codon at nucleotide positions 297-299 (Figure 191). The 
predicted polypeptide precursor is 86 amino acids long (Figure 192). The full-length PRO 1029 protein shown 
in Figure 192 has an estimated molecular weight of about 9,548 dahons and a pi of about 8.52. Analysis of the 
30 full-length PRO1029 sequence shown in Figure 192 (SEQ ID NO:274) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 19, an amino acid block having homology to 
bacterial rhodopsins retinal binding site protein from about amino acid 50 to about amino acid 61 , a prenyl group 
binding site from about amino acid 83 to about amino acid 86 and a potential N-glycosylation site from about 
amino acid 45 to about amino acid 48. Clone DNA59493-1420 has been deposited with ATCC on July 1 , 1998 
35 and is assigned ATCC deposit no. 203050, 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 192 (SEQ ID NO:274), evidenced significant 
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homology between the PRO1029 amino acid sequence and the following Dayhoff sequences: S66088, 
AF031815J, MM4A6LJ, PSElS52a-l, SI 7699 and P_R63635. 

EXAMPLE 85 : Isolation of cDNA clones Encoding Human PROl 139 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
5 cluster sequence from the Incyte database, designated 4461 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LTFESEQ* Incyte Pharmaceuticals. Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et ai. t Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 

10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57312. 

The DNA57312 consensus sequence included a 172 nucleotides long public EST (T62095, 
Merck/University of Washington public database). This EST clone, identified herein as a putative protein coding 

15 sequence, was purchased from Merck, and sequenced to provide the coding sequence of PROl 139 (Figure 193). 
As noted before, the deduced amino acid sequence of DNA59497- 1496 shows a significant sequence identity with 
the deduced amino acid sequence of HSOBRGRPI . The full-length protein (Figure 194) contains a putative 
signal peptide between amino acid residues 1 and about 28, and three putative transmembrane domains 
(approximate amino acid residues 33-52, 71-89, 98-120). 

20 

EXAMPLE 86 : Isolation of cDNA clones Encodins Human PRO 1309 

An expressed sequence tag (EST) DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed homology to SLIT. 

RNA for construction of cDNA libraries was isolated from human fetall brain tissue. The cDNA 
25 libraries used to isolate the cDN A clones encoding human PRO 1 309 were constructed by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et at. , Science . 
30 253:1278-1280 (1991)) in the unique Xhol and NotI. 

The cDNA libraries (prepared as described above), were screened by hybridization with a synthetic 
oligonucleotide probe derived from the above described Incyte EST sequence: 

5 -TCCGTGCAGGGGGACGCCTTTCAGAAACTGCGCCGAGTTAAGGAAC-3' (SEQ ID NO:279). 

A cDNA clone was isolated and sequenced in entirety. The entire nucleotide sequence of DNA59588- 
35 1571 is shown in Figure 195 (SEQ ID NO:277). Clone DNA59588-1571 contains a single open reading frame 
with an apparent translational initiation site at nucleotide positions 720-722 and a stop codon at nucleotide 
positions 2286-2288 (Figure 195; SEQ ID NO:277). The predicted polypeptide precursor is 522 amino acids 
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long. The signal peptide is approximately at 1-34 and the transmembrane domain is at approximately 428-450 
of SEQ ID N0:278. Clone DNA59588-1571 has been deposited with ATCC and is assigned ATCC deposit no. 
203106. The full-length PRO 1309 protein shown in Figure 196 has an estimated molecular weight of about 
58,614 daltons and a pi of about 7.42. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
5 alignment analysis of the full-length sequence shown in Figure 196 (SEQ ID NO:278), revealed sequence identity 
between the PRO 1309 amino acid sequence and the following Dayhoff sequences: AB007876_1 , GPV_MOUSE, 
ALS RAT, P_R85889, LUM_CHICK, ABO 14462 J, PGS1_CANFA, CEM88_7, A58532 and GEN11209. 

EXAMPLE 87 : isolation of cDNA Clones Encoding Human PRO 1028 

10 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

15 Enzvmology 266:460-480 ( 1 996)) . Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA59603. 

In light of an observed sequence homology between the DNA59603 sequence and an EST sequence 

20 contained within Incyte EST clone no. 1497725, the Incyte EST clone no. 1497725 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 197 and is herein designated as DNA59603-I419. 

The entire nucleotide sequence of DNA59603-1419 is shown in Figure 197 (SEQ ID NO:280). Clone 
DNA5 9603- 14 19 contains a single open reading frame with an apparent translational initiation site at nucleotide 

25 positions 21-23 and ending at the stop codon at nucleotide positions 612-614 (Figure 197). The predicted 
polypeptide precursor is 197 amino acids long (Figure 198). The full-length PRO 1028 protein shown in Figure 
198 has an estimated molecular weight of about 20,832 daltons and a pi of about 8.74. Clone DNA59603-1419 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

30 Analyzing the amino acid sequence of SEQ ID NO:281 , the putative signal peptide is at about amino 

acids M9ofSEQIDNO:281. An N-glycosylation site is at about amino acids 35-38 of SEQ ID NO :281. A 
C-rype lectin domain is at about amino acids 108-1 17 of SEQ ID NO:281 , indicating that PR0513 may be related 
to or be a lectin. The corresponding nucleotides of these amino acid sequences or others can be routinely 
determined given the sequences provided herein. 

35 
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EXAMPLE 88 : Isolation of cDNA Clones Encoding Human PRO 1027 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 
EST cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ* lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods P* 
Enzymologx 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56399. 
10 In light of an observed sequence homology between the DNA56399 sequence and an EST sequence 

contained within lncyte EST clone no. 937605, the lncyte EST clone no. 937605 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 199 and is herein designated as DNA59605-1418. 

The entire nucleotide sequence of DNA59605-1418 is shown in Figure 199 (SEQ ID NO:282). Clone 
15 DNA59605-1418 contains a single open reading frame with an apparent translationaJ initiation site at nucleotide 
positions 31-33 and ending at the stop codon at nucleotide positions 262-264 (Figure 199). The predicted 
polypeptide precursor is 77 amino acids long (Figure 200). The full-length PRO 1027 protein shown in Figure 
200 has an estimated molecular weight of about 8,772 daltons and a pi of about 9.62. Clone DNA59605-1418 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
20 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO: 283, the putative signal peptide is at about amino 
acids 1-33 of SEQ ID NO:283. The type II fibronectin collagen-binding domain begins at about amino acid 30 
of SEQ ID NO:283. The corresponding nucleotides for these amino acid sequences and others can be routinely 
determined given the sequences provided herein. PRO1027 may be involved in tissue formation or repair, 
25 The following Dayhoff designations appear to have some sequence identity with PRO 1027: 

SFT2YEAST; ATM3E9J2; A69826; YM16_MARPO; E64896; U60193_2; MTLRAJ205J; MCU60315_70; 
SPAS_SHIFL; and S54213. 

EXAMPLE 89 : Isolation of cDNA Clones Encoding Human PRO 11 07 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ*, lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 

35 Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program *phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56402. 

In light of an observed sequence homology between the DNA56402 sequence and an EST sequence 
contained within Incyte EST clone no. 3203694, the lncyte EST clone no. 3203694 was purchased and the cDN A 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 201 and is herein designated as DNA59606-1471. 
5 The entire nucleotide sequence of DNA59606-1471 is shown in Figure 201 (SEQ ID NO:284). Clone 

DNA59606-147 1 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 244-246 and ending at the stop codon at nucleotide positions 1675-1677 of SEQ ID NO:284 (Figure 
201). The predicted polypeptide precursor is 477 amino acids long (Figure 202). The full-length PROl 107 
protein shown in Figure 202 has an estimated molecular weight of about 54,668 daltons and a pi of about 6.33. 
10 Clone DNA59606-1471 has been deposited with ATCC on June 9, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 107 polypeptide suggests that it possesses 
significant sequence similarity to phosphodiesterase I/nucleotide phyrophosphatase, human insulin receptor 
15 tyrosine kinase inhibitor, alkaline phosphodiesterase and autotaxin, thereby indicating that PROl 107 may have 
at least one or all of the activities of these proteins, and that PROl 107 is a novel phosphodiesterase. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PROl 107 amino acid sequence and at least the following Dayhoff sequences: AF005632_1, 
P_R79148, RNU78787J, AF060218_4, A57080 and HUMATXTJ. 

20 

EXAMPLE 90 : Isolation of cDNA clones Encoding Human PROl 140 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence, Incyte cluster sequence No. 135917. This sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 

25 EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 

30 therefrom is herein designated DNA56416. 

In light of an observed sequence homology between DNA56416 and an EST sequence contained within 
Incyte EST clone no. 3345705, Incyte EST clone no. 3345705 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59607-1497, which 
is shown in Figure 203, is the full-length DNA sequence for PROl 140. Clone DNA59607-1497 was deposited 

35 with the ATCC on June 9. 1998, and is assigned ATCC deposit no. 209946. 

The entire nucleotide sequence of DNA59607-1497 is shown in Figure 203 (SEQ ID NO:286). Clone 
DNA59607-1497 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 210-212 and ending at the stop codon at nucleotide positions 975-977 (Figure 203). The predicted 
polypeptide precursor is 255 amino acids long (Figure 204). The hill-length PROl 140 protein shown in Figure 
204 has an estimated molecular weight of about 29,405 daltons and a pi of about 7.64. Analysis of the full- 
length PROl 140 sequence shown in Figure 204 (SEQ ID NO:287) evidences the presence of three 
transmembrane domains at about amino acids 101 to 118, 141 to 161 and 172 to 191. 

Analysis of the amino acid sequence of the full-length PROl 140 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PROl 140 amino acid sequence and the following 
Dayhoff sequences: AF023602J, AF000368J, CIN3_RAT, AF003373J, GEN13279, and AF003372J. 

Clone DNA59607-1497 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209946. 



10 



FX AMPLE 91 : Isolation of cDNA clones Encoding Human P ROl 106 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence. This sequence was then compared to a variety of expressed sequence tag (EST) databases 
which included public EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 

15 Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 ( Altshul et al. , Mnhod< in FnTvmologv 266:460-480 (1996)). Tnose 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green. 
Univ. of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated 

20 DNA56423. 

In light of an observed sequence homology between DNA56423 and an EST sequence contained within 
Incyte EST clone no. 1711247, Incyte EST clone no. 171 1247 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59609-1470, which 
is shown in Figure 205, is the full-length DNA sequence for PROl 106. Clone DNA59609-1470 was deposited 

25 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209963. 

The entire nucleotide sequence of DNA59609-1470 is shown in Figure 205 (SEQ ID NO:288). Clone 
DNA59609-1470 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 61-63 and ending at the stop codon at nucleotide positions 1468- 1470 of SEQ ID NO:288 (Figure 205). 
The predicted polypeptide precursor is 469 amino acids long (Figure 206). The full-length PROl 106 protein 

30 shown in Figure 206 has an estimated molecular weight of about 52,689 daltons and a pi of about 8.68. It is 
understood that the skilled artisan can construct the polypeptide or nucleic acid encoding therefor to exclude any 
one or more of all of these domains. For example, the transmembrane domain region(s) and/or either of the 
amino terminal or carboxyl end can be excluded. Clone DNA59609-1470 has been deposited with ATCC on 
June 9. 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that the 

35 sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 11 06 polypeptide suggests that it possesses 
significant sequence similarity to the peroxisomal ca-dependent solute carrier, thereby indicating that PROl 106 
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may be a novel transporter. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 
35) evidenced sequence identity between the PROl 106 amino acid sequence and at least the following Dayhoff 
sequences, AF004161J, IG002N0125, GDC^BOVIN and BT1_MAIZE. 

EXAMPLE 92 : Isolation of cDNA clones Encodine Human PRO 1291 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 120480. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

10 ei al . , Methods in Enzvmology 266:460480 ( 1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56425. 

In light of an observed sequence homology between the DNA56425 sequence and an EST sequence 

15 encompassed within the Incyte EST clone no. 2798803, the Incyte EST clone 2798803 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 207 and is herein designated as DNA59610-1556. 

Clone DNA59610-1556 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 61-63 and ending at the stop codon at nucleotide positions 907-909 (Figure 207). The 

20 predicted polypeptide precursor is 282 amino acids long (Figure 208). The full-length PR01291 protein shown 
in Figure 208 has an estimated molecular weight of about 30,878 daltons and a pi of about 5.27. Analysis of 
the full-length PRO 1291 sequence shown in Figure 208 (SEQ ID NO:291) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 28, a transmembrane domain from about 
amino acid 258 to about amino acid 281 and potential N-glycosylation sites from about amino acid 1 12 to about 

25 amino acid 1 15 , from about amino acid 1 60 to about amino acid 163, from about amino acid 190 to about amino 
acid 193, from about amino acid 196 to about amino acid 199, from about amino acid 205 to about amino acid 
208, from about amino acid 216 to about amino acid 219 and from about amino acid 220 to about amino acid 
223.. Clone DNA59610-1556 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209990. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 208 (SEQ ID NO:291), evidenced significant 
homology between the PR01291 amino acid sequence and the following Dayhoff sequences: HSU90552_I, 
HSU90144J, AF033107J, HSB73J, HSU90142 1, GGCD80J, P W34452, MOG_MOUSE, B39371 and 
PR71360. 

35 
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EXAMPLE 93 : Isolation of cDNA clones Encoding Hu man PRO 1105 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Method? jn 
Enzvmology 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program *phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56430. 

10 In light of an observed sequence homology between the DNA56430 sequence and an EST sequence 

encompassed within the Incyte EST clone no. 1853047, the Incyte EST clone 1853047 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 209 and is herein designated as DNA596 12-1466. 

The entire nucleotide sequence of DNA59612-1466 is shown in Figure 209 (SEQ ID NO:292). Clone 

15 DNA59612-1466 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 28-30 and ending at the stop codon at nucleotide positions 568-570 of SEQ ID NO: 292 (Figure 209). 
The predicted polypeptide precursor is 180 amino acids long (Figure 210). The full-length PROl 105 protein 
shown in Figure 210 has an estimated molecular weight of about 20,040 daltons and a pi of about 8.35. Clone 
DNA59612-1466 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone 

20 has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 210, a signal peptide is at about amino acids 1-19 of SEQ ID NO:293 and 
transmembrane domains are shown at about amino acids 80-99 and 145-162 of SEQ ID NO:293. It is 
understood that the skilled artisan could form a polypeptide with all of or any combination or individual selection 
25 of these regions. It is also understood that the corresponding nucleic acids can be routinely identified and 
prepared based on the information provided herein. 

EXAMPLE 94 : Isolation of cDNA clones Encoding Human PRQ51 1 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

30 cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Ahshu! et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

35 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56434. 
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In light of an observed sequence homology between the DNA56434 sequence and an EST sequence 
encompassed within the Incyte EST done no. 1227491, the Incyte EST clone 1227491 was purchased and die 
cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 21 1 and is herein designated as DNA59613-1417. 

The entire nucleotide sequence of DNA59613-1417 is shown in Figure 211 (SEQ ID NO:294). Clone 

5 DNA59613-1417 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 233-235 and ending at the stop codon at nucleotide positions 944-946 (Figure 211). The predicted 
polypeptide precursor is 237 amino acids long (Figure 212). The full-length PR051 1 protein shown in Figure 
212 has an estimated molecular weight of about 25,284 daltons and a pi of about 5.74. Clone DNA59613-1417 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 

10 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:295, the putative signal peptide is at about amino 
acids 1-25 of SEQ ID NO:295. The N-glycosylation sites are at about amino acids 4548, 73-76, 107-1 10, 1 18- 
121. 132-135, 172-175, 175-178 and 185-188 of SEQ ID NO:295. An arthropod defensins conserved region 
is at about amino acids 176-182 of SEQ ID NO:295. A kringle domain begins at about amino acid 128 of SEQ 

15 ID NO:295 and a ly-6/u-PAR domain begins at about amino acid 6 of SEQ IDNO:295. The corresponding 
nucleotides of these amino acid sequences and others can be routinely determined given the sequences provided 
herein. 

The designations appearing in a Dayhoff database with which PROS 1 1 has some sequence identity are 
as follows: SSC20F10J; SF04I083; P_W26579; S44208; JC2394; PSTAJMCDI; A27020; S59310; 
20 RAG1RABIT; and MUSBALBC1J. 

EXAMPLE 95 : Isolation of cDNA clones Encoding Human PROl 104 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

25 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

30 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56446. 

In light of an observed sequence homology between the DNA56446 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2837496, the Incyte EST clone 2837496 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 

35 sequence of this cDNA insert is shown in Figure 213 and is herein designated as DNA59616-1465. 

The entire nucleotide sequence of DNA59616-1465 is shown in Figure 213 (SEQ ID NO:296). Clone 
DNA59616-1465 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
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positions 10°-! 11 and ending a. the s,op codon a: nucleotide positions 1 132-1 134 of SEQ ID NO:296 (Figure 
213). The predated polypeptide precursor is 341 amino acids long (Figure 2.4). The full-leng* PRO1104 
protein shown in Figure 214 has an estimated molecular weight of about 36.769 daltoas and a pi of about 9.03. 
Clone DNA59616-1465 has been deposited with ATCC on June 16, 1998. 1. is understood that the deposited 
clone has the acmal nucleic acid sequence and that the sequences provided herein are based on known 

5 sequencing techniques. 

Analyzing Figure 214, a signal peptide is at about amino acids 1-22 of SEQ ID NO:297. N- 
rnyristoylation sites are at about amino acids 41-46, 110-115. 133-138, 167-172 and 179-184 of SEQ ID 
NO:297. 

10 PV AMPLE 96 : Isolation of cDNA clones Encoding Human PROl 100 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Ufeseq* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

15 homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmology 266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the obtained consensus sequence and an EST 

20 sequence encompassed within the Incyte EST clone no. 2305379, the Incyte EST clone 2305379 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 215 and is herein designated as DNA59619-1464. 

The entire nucleotide sequence of DNA59619-1464 is shown in Figure 215 (SEQ ID NO:298). Clone 
DNA59619-1464 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 

25 positions 33-35 and ending at the stop codon at nucleotide positions 993-995 of SEQ ID NO:298 (Figure 215). 
The predicted polypeptide precursor is 320 amino acids long (Figure 216). The full-length PRO1100 protein 
shown in Figure 21 6 has an estimated molecular weight of about 36,475 daltons and a pi of about 7.29. Clone 
DNA59619-1464 has been deposited with ATCC on July 1, 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 

30 techniques. 

Upon analyzing SEQ ID NO:299, the approximate locations of the signal peptide, the transmembrane 
domains, an N-glycosylation site, an N-myristoylation site, a CUB domain and an amiloride-sensitive sodium 
channel domain are present. It is believed that PROl 100 may function as a channel. The corresponding nucleic 
acids for these amino acids and others can be routinely determined given SEQ ID NO:299.. 

35 
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FY AMPLE 97 : Untatinn of cD* > * rlnnes Encorlinp Human PRQ836 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was men compared to a vanery of 
expressed sequence tag (EST) databases which included public EST databases (e.g. . GenBank) and a propnetary 
EST DNA database (Lifeseq* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Method 
En2aS cMy.266:46M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did no. encode known proteins were clustered and assembled into a consensus DNA sequence wtth 
me program "phrap" (Phil Green, University of Washington, Seattle. Washington). The consensus sequence 
obtained is herein designated DNA56453. 
10 In light of an observed sequence homology between me DNA56453 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2610075. me Incyte EST clone 2610075 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length proton. 
The sequence of this cDNA insert is shown in Figure 217 and is herein designated as DNA59620-1463. 

The entire nucleotide sequence of DNA59620-1463 is shown in Figure 217 (SEQ ID NO:300). Clone 
15 DNA59620-1463 contains a single open reading frame with an apparent translatiohal initiation site at nucleotide 
positions 65-67 and ending at the stop codon at nucleotide positions 1448-1450 of SEQ ID NO:300 (Figure 217). 
The predicted polypeptide precursor is 461 amino acids long (Figure 218). The full-length PROS36 protein 
shown in Figure 218 has an estimated molecular weight of about 52.085 daltons and a pi of about 5.36. Analysis 
of the full-length PR0836 sequence shown in Figure 218 (SEQ ID NO:301) evidences the presence of the 
20 following: a signal peptide, N-glycosylation sites, N-mynstoylation sites, a domain conserved in the 
YJL126w/YLR351c/yhcX family of proteins, and a region having sequence identity with SLS1. Clone 
DNA59620-1463 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

25 Analysis of the amino acid sequence of the full-length PR0836 polypeptide suggests thai it possesses 

some sequence similarity to SLS1 , thereby indicating that PR0836 may be involved in protein translocation of 
the ER. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some 
homology between the PR0836 amino acid sequence and at least the following Dayhoff sequences, S58132, 
SPBC3B9 1, S66714, CRU40057 1 and IMACAEEL. 

30 

EXAMPLE 98 : Isolation of cDNA clones E ncoding Human PRO 1141 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 11873. This EST cluster sequence was then compared 
to a vanety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
35 and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
ct al., M^hrvU in Fnzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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m some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program -phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56518. 

In light of an observed sequence homology between the DNA56518 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2679995, the Incyte EST clone 2679995 was purchased 
5 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 219 and is herein designated as DNA59625-1498. 

Clone DNA59625-1498 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 204-206 and ending at the stop codon at nucleotide positions 945-947 (Figure 219). 
The predicted polypeptide precursor is 247 amino acids long (Figure 220). The fulMength PROl 141 protein 
10 shown in Figure 220 has an estimated molecular weight of about 26,840 daltons and a pi of about 8.19. Analysis 
of the full-length PROl 141 sequence shown in Figure 220 (SEQ ID NO:303) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19 and transmembrane domains from 
about amino acid 38 to about amino acid 57, from about ammo acid 67 to about amino acid 83. from about 
amino acid 117 to about amino acid 139 and from about amino acid 153 to about amino acid 170. Clone 
15 DNA59625-1498 has been deposited with ATCC on June 16. 1998 and is assigned ATCC deposit no. 209992. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the fulMength sequence shown in Figure 220 (SEQ ID NO:303), evidenced significant 
homology between the PROl 141 amino acid sequence and the following Dayhoff sequences: CEVF36H2L_2, 
PCRB7PRJJ. AB000506J. LEU95008J, MRU87980J5. YIGM_ECOLI, STU65700J, GHU62778J, 
20 CYST_SYNY3 and AF009567J . 

EXAMPLE 99 : Isolation of cD N A clones Encoding Human PROl 132 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA35934. Based on the DNA35934 
25 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PROl 132. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5-TCCTGTGACCACCCCTCTAACACC-3' (SEQ ID NO:310) and 
30 reverse PCR primer : 5 ' -CTGG A AC ATCTGCTGCCC AGATTC -3 ' (SEQ ID NO:31 1), 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
sequence which had the following nucleotide sequence: 

5 -GTCGGATGACAGCAGCAGCCGCATCATCAATGGATCCGACTGCGATATGC-3' (SEQ ID NO:312). 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 132 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 132 and the derived protein sequence for PROl 132. 

The entire nucleotide sequence of PROl 132 is shown in Figure 225 (SEQ ID NO:308). Clone 
DNA59767-1489 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 354-356 and a stop codon at nucleotide positions 1233-1235 (Figure 225; SEQ ID NO:308). The 
5 predicted polypeptide precursor is 293 amino acids long. The signal peptide is at about amino acids 1-22 and 
the histidine active site is at about amino acids 104-109 of SEQ ID NO:309. Clone DNA59767-1489 has been 
deposited with ATCC (having the actual sequence rather than representations based on sequencing techniques 
as presented herein) and is assigned ATCC deposit no. 203108. The full-length PROl 132 protein shown in 
Figure 226 has an estimated molecular weight of about 32,020 daltons and a pi of about 8.7. 
10 An analysis of the DayhofT database (version 35.45 SwissProt 35), using a WTJ-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 226 (SEQ ID NO:309) t revealed sequence identity 
between the PROl 132 amino acid sequence and the following Dayhoff sequences: SSU76256J, P_W10694, 
MMAE000663_6, AF013988J, U6606IJ, MMAE000665_2, MMAE00066415, MMAE00066414, 
MMAE000665_4 and MMAE00066412. 

15 

F.X AMPLE 100 : Isolation of cD NA clones Encoding Human NL7 fPRQ134$) 

A single EST sequence (#1398422) was found in the LIFESEQ" database as described in Example 1 
above. This EST sequence was renamed as DNA45668. Based on the DNA45668 sequence, oligonucleotides 
were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use 
20 as probes to isolate a clone of the full-length coding sequence for NL7. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5'-CACACGTCCAACCTCAATGGGCAG-3' (SEQ ID NO:315) 
reverse PCR primer : 5 * -GACCAGC AGGGCC AAGG AC A AGG -3 ' (SEQ ID NO:316) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
25 DNA45668 sequence which had the following nucleotide sequence: 
hybridization probe: 

5'-GTTCTCTGAGATGAAGATCCGGCCGGTCCGGGAGTACCGCTTAG-3' 
(SEQIDNO:317) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
30 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the NL7 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from a human fetal kidney library (L1B227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for NL7 
(designated herein as DNA59776-1600 [Figure 227, SEQ ID NO:3 13]) and the derived protein sequence for NL7 
35 (PR01346). 

The entire coding sequence of NL7 (PR01346) is shown in Figure 227 (SEQ ID NO:313). Clone 
DNA59776-1600 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
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positions 1-3 and an apparent stop codon at nucleotide positions 1384-1386. The predicted polypeptide precursor 
is 46 1 amino acids long. The protein contains an apparent type II transmembrane domain at amino acid positions 
from about 31 to about 50; fibrinogen beta and gamma chains C-terminal domain signature starting at about 
amino acid position 409, and a leucine zipper pattern starting at about amino acid positions 140, 147, 154 and 
161, respectively. Clone DNA59776-1600 has been deposited with ATCC and is assigned ATCC deposit no. 
5 203128. The full-length NL7 protein shown in Figure 228 has an estimated molecular weight of about 50,744 
daltons and a pi of about 6.38. 

Based on a WU-BLAST2 sequence alignment analysis (using the WU-BLAST2 computer program) of 
the full-length sequence, NL7 shows significant amino acid sequence identity to a human micro fibril -associated 
glycoprotein (1 MFA4_HUMAN); to known TIE-2 ligands and ligand homologues, ficolin, serum lectin and 
10 TGF-1 binding protein. 

EXAMPLE 101: Isolation of cDNA clones Encoding Human PROl 131 

A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated 
DNA43546 (see Figure 231; SEQ ID NO:320). The DNA43546 sequence was then compared to a variety of 

1 5 expressed sequence tag (EST) databases which included public EST databases (e . g GenBank) and a proprietary 
EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Aho, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with 

20 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA45627. 

Based on the DNA45627 sequence, oligonucleotide probes were generated and used to screen a human 
library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is 
a precursor of pRK5D that does not contain the Sfil site; see, Holmes et aL, Science 253:1278-1280 (1991)), 

25 and the cDNA size cut was less than 2800 bp. 

PCR primers (forward and 2 reverse) were synthesized: 
forward PCR primer 5'-ATGCAGGCCAAGTACAGCAGCAC-3' (SEQ ID NO:321); 
reverse PCR primer 1 5 '-CATGCTGACGACTTCCTGCAAGC-3 ' (SEQ ID NO: 322); and 
reverse PCR primer 1 5'-CCACACAGTCTCTGCTTCTTGGG-3' (SEQ ID NO:323) 

30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45627 

sequence which had the following nucleotide sequence: 
hybridization probe 

5 '-ATGCTGGATG ATG ATGGGG ACACC ACCATGAGCCTGC ATT-3 ' (SEQ ID NO:324). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01131 gene using the probe oligonucleotide and one of the PCR primers. 
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A full length clone was identified that contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 144-146, and a stop signal at nucleotide positions 984-986 
(Figure 229; SEQ ID NO:318). The predicted polypeptide precursor is 280 amino acids long, has a calculated 
molecular weight of approximately 31,966 daltons and an estimated pi of approximately 6.26. The 
transmembrane domain sequence is at about 49-74 of SEQ ID N0:319 and the region having sequence identity 

5 with LDL receptors is about 50-265 of SEQ ID NO:319. PROl 131 contains potential N-linked glycosylate 
sites at amino acid positions 95-98 and 169-172 of SEQ ID NO:319. Clone DNA59777-1480 has been deposited 
with the ATCC and is assigned ATCC deposit no. 203 111. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in.Figure 230 (SEQ ID NO:319), evidenced some sequence 

10 identity between the PR01131 amino acid sequence and the following Dayhoff sequences: AB0i0710_l, 149053, 
149115, RNU56863J, LY4A_MOUSE, 155686, MMU56404J, 149361, AF030313J and MMU09739J. 



FX AMPLE 102 : Isolation of cD NA clones Encoding Human PRQ1281 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein as DNA35720. Based on the DNA35720 

sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 

of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 1281 . 
PCR primers (forward and reverse) were synthesized: 

forward PCR primers : 
20 5'-TGGAAGGCTGCCGCAACGACAATC-3 ' (SEQ ID NO:327); 

5'-CTGATGTGGCCGATGTTCTG-3' (SEQ ID NO:328); and 

5 ' - ATGGCTC AGTGTGC A G AC AG - 3 ' (SEQ ID NO:329). 

reverse PCR primers : 

5'-GCATGCTGCTCCGTGAAGTAGTCC-3' (SEQ ID NO:330); and 
25 5 ' - ATGC ATGGG AAAG AAGGCCTGCCC-3 ' (SEQ ID NO:33i). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA35720 sequence 
which had the following nucleotide sequence: 
hybridization probe : 

5 ' -TGC ACTGGTG ACC ACG AGGGGGTGC ACTAT AGCC ATCTGG AGCTGAG-3 ' (SEQ ID NO:332). 
30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 

to isolate clones encoding the PRO 1281 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
35 PR01281 (designated herein as DNA59820-1549 [Figure 232, SEQ ID NO:325]; and the derived protein 

sequence for PRO 1281. 
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The entire coding sequence of PR01281 is shown in Figure 232 (SEQ ID N0.325). Cone DNA59820- 
,549 contatns a single open reading frame with an apparent transanal initiation site at nucleotide posutons 
228 -230 and an apparent stop codon at nucieotide positions 2553-2555. The predicted polypeptide precursor 
■ n 775 amino acids iong. The fuH-length PR01281 protein shown tn Figure 233 has an esUn^ted molecular 
we lg h, of about 85.48! dal.ons and a pi of about 6.92. Additional features include a stgna. pep— « — ' 
5 amino acids MS; and potentia. N-glycosy.ation sites a. about amino ac.ds 138-141 and 361-364 

An analysis of *e Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
..gnment analysU of the full-length sequence shown in Figure 233 ,SEQ ,D NO:326). revealed som^ecuencc 
iuenutybetweenfcePRO^lanunoacid*^^^ 

CEC38H2.3, CAC2_HAECO. B3A2HUMAN, S22373. CEF38A3_2, CEC34F6_2, CEC34F6_3. and 
10 CELT22 c!l"e DNA59820-1549 has been deposited with ATCC and is assigned ATCC deposit no: 203129. 

FY AMPLE 103 : '-" ft" Fncodin * Huma " PR ° 1064 

A cDN A sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 
15 BLAST2 sequence alignment computer program, to have no significant sequence Mennty to any Known human 
prot ein This cDNA sequence is herein designated DNA45288. The DNA45288 sequence was men compared 
to various EST databases including public EST databases (e.g.. GenBank), and a proprietary EST database 
(LIFESEQ* incyte Pharmaceutical. Palo Alto. CA) to identify homologous EST sequences. The companson 
was performed using the computer program BLAST or BLAST2 |AltschuI et al.. Mejjo^ur^y* 
20 266-46CM80 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases. 90) or greater 
did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle. Washington). This consensus sequence ,s 
herein designated DNA48609. Oligonucleotide primers based upon the DNA48609 sequence were men 
synthesized and employed to screen a human fetal kidney cDN A library which resulted in the identified of 
25 the DNA59827-1426 clone shown in Figure 234. The cloning vector was pRK5B (pRK5B is a precursor of 
pRK5D that does no. contain the Sfil site; see. Holmes et al.. Stins. 251:1278-1280 (1991)). and the cDNA 

size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
f^M PTE nrimer 5-CTCAGACCCTGCACCACCATCTG-3' (SEQ ID NO:336) 
30 ™™PCR orimer 5'-GGTGCTTCTTGAGCCCCACTTAGC-3' (SEQ ID NO:337) 

Addmonally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA48609 sequence which had the following nucleotide sequence 
hybridation probe 

5-AATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGT-3' (SEQ ID NO:338) 
35 A full length clone was identified that contained a single open reading frame with an apparent 

transitional inniatton site a, nucleotide positions 532-534 and a stop signal a. nucleotide positions 991-993 
(Figure 234, SEQ ID NO:333). The predicted polypepude precursor is 153 amino acids long, has a calculated 
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molecular *eigh t of approximately 17,317 daltons and an estimated pi of approximately 5.17. Analysis of the 
fulMeng* PRO1064 sequence shown in Ftgure 235 (SEQ ID NO:334) evidences the presence of the followtng. 
a signal peptide from about annuo acid 1 to about amino acid 24. a transmembrane domain from about ammo 
acid 89 to about amino acid 1 10. an indole-3-glycerol phosphate synthase homology block from about ammo acd 
74 ,o about amino acid 105 and a Myb DN A binding domain protein repeat protein homology block from about 
5 amino acid 1 14 to about amino acid 137. Clone DNA59827- 1426 has beeti deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203089. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysts of the full-lenph sequence shown in Figure 235 (SEQ ID NO-.334). evidenced homology 
between the PRO1064 amino acid sequence and the following Dayhoff sequences: MMNP15PR0.1. 
10 BP187PLYH.1. CELF42G8_4.MMU58888_1 ,GEN14270.TUB8_SOLTU.RCN_MOUSE.HUMRBSY79J. 
SESENODAJ and A21467J. 

FY AMPLE 104 : i^iatinn of cDNA Hones Encoding Human PRCM379 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein DNA45232. Based on the DNA45232 
consensus sequence, ohgonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01379. 

PCR primers (forward and reverse) were synthesized: 
20 fn™rH PCR primer 5 ' -TGG AC ACCGT ACCCTGGTATCTGC-3 (SEQ ID NO:34l) 
r^ne PCR primer 5 ' -CC AACTCTG AGG AGAGC AAGTGGC-3 (SEQ ID NO:342) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45232 sequence which had the following nucleotide sequence: 
h ybridization probe 

25 5MGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAAC3' (SEQ ID NO:343). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01379 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated human fetal kidney tissue. 
30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01379 which is designated herein as DNA59828-1608 and shown in Figure 237 (SEQ ID NO:339); and the 
derived protein sequence for PR01379 (SEQ ID NO:340). 

The entire coding sequence of PR01379 is shown in Figure 237 (SEQ ID NO:339). Clone DN A59828- 
1608 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
35 10-12 and an apparent stop codon at nucleotide positions 1732-1734. The predicted polypeptide precursor is 574 
amino acids long. The full-length PR01379 protein shown in Figure 238 has an estimated molecular weight of 
about 65,355 daltons and a pi of about 8.73. Additional features include a signal pep«°e at about amino acids 
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, ,7 and potentia. N-g.ycosyU.ion sites a, abou, amino acids 160-163, 287-290. and 323-326. 

2 ana.ys.s oL Dayhoff database ^ion 35.45 SwissPro, 35, usmg a WU-BLAST2 sequence 

,gnmen,an,ysisof*e^^ 

HP714394 1, and HIV18U45630J. 

Done DNA59828-.608 has been deposited with ATCC and is assigned ATCC deposit no. 203.58. 

. . Mr , F r^.in, of cDNA flmr. FnTortmr Human PR0844 
" pressed sequence tag (EST) DNA database (LIFESEQ™, .ncyte Pharmaceuucals. Palo Ate. CA) 

was searched and an EST was identified wh,ch showed sequence ident.ty with aLP. Based on the informauon 

was further examined. 

DNA sequencing of the insert for this clone gave a sequence (herein designated as DNA59838-.462, 
SEQ ID N0.344) which indudes the ful.-.ength DNA sequence for PR0844 and the derived protein sequence 

fnr PRO 844 

r,eent,renuc,eotid e seq U enceofDNA59838-1462Ushown I „Fig U re23 9 (SEQ.DNO:344). Cone 
DNA59838-1462 contains a single open reading frame with an apparent translationa. initiation site a, nuc.eot.de 
^.ions 5-7 and ending at the stop codon at nucleotide positions 338-340 of SEQ ID NO:344 (Figure 239). 
L predicted peptide precursor is ... amino acids ,ong (F.gure 240). The fu.,,en g * PR0844 protem 
shown in Figure 240 has an estimated molecular weigh, of abou, .2,050 daltons and a pl of abou, 5.45. Cone 
U NQ544DNA59838-.462hasbeendepos.,edwithATCConJune.6,1998. ,, is understood mat me depost.ed 
done has the actua. nuc.eic acid sequence and that the sequences provided herein are based on known 

sequencing techniques. , 

Anatysis of me amino acid sequence of Ae full-.ength PR0844 polypeptide suggests that « possesses 
significant sequence simi.ari,y .0 serine protease inhibuors. thereby inditing ma, PRD844 may be a nove, 
proteinase inhibitor. More specifically, an analysis of the Dayhoff debase (version 35.45 Sw.ssPro« 35 
evidenced significant homology between me PR0844 amino acid sequence and a, leas, the foHowing Dayhoff 
sequences. ALK1_HUMAN, P.P82403. P.P82402, ELAF_HUMAN and P.P60950. 

FX AMPLE 106 : Unlaiinn of cDN A Hones Encoding Human PR084? 

Use of me signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared ,0 a vanery of 
expressed sequence u.g (EST) databases which included public EST da,abases (e.g.. GenBank) and a propnerary 
EST DNA database (LIFESEQ*. Incyte Pharmaceuticals. Palo Alto. CA) to identify existing homolog.es. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul e. a... Mejhodsm 
35 Ejgyjnolasv 266460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater tha, did no, encode known proteins were clustered and assembled in,o a consensus DNA sequence with 
me program "phrap" (Phil Green. Univers.ty of Washington. Seattle. Washington). The consensus sequence 
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obtained therefrom is herein designated DNA55999. 

In light of an observed sequence homology between the DNA55999 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2768571 . the Incyte EST clone 2768571 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 241 and is herein designated as DNA59839-1461. 

5 The entire nucleotide sequence of DNA59839-1461 is shown in Figure 241 (SEQ ID NO:346). Clone 

DNA59839-1461 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 146-148 and ending at the stop codon at nucleotide positions 1946-1948 of SEQ ID NO:346 (Figure 
241). The predicted polypeptide precursor is 600 amino acids long (Figure 242). The full-length PR0848 
proicmsliOwnmFi^ Clone DNA59839- 1461 

10 hasbeendepositedwithATCConJunel6 t 1998. It is understood that the deposited clone has the actual nucleic 
acid sequence and that the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0848 polypeptide suggests that it may be a 
novel sialyltransferase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0848 amino acid sequence and at least the following Dayhoff 

15 sequences, P_R78619 (GalNAc-alpha-2, 6-sialyltransferase), CAAG5_CHICK (alpha-n-acerylgalactosamide 
alpha-2, 6-sialytransferase) MSV 14550.1 . CAG6HUMAN and P_R632 17 (human alpha-2, 3-sialylrransferase). 

EXAMPLE 107 : Isolation of cDNA Clones Encoding Human PRO 1097 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

20 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Alfshul el ai., Methods in 
Enzvmology 266:460-480 ( 1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

25 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56006. 

In light of an observed sequence homology between the DNA56O06 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2408105, the Incyte EST clone 2408105 was purchased 

30 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 243 and is herein designated as DNA5984 M460. 

The entire nucleotide sequence of DNA59841-1460 is shown in Figure 243 (SEQ ID NO:348). Clone 
DNA5984 1-1460 contains a single open reading frame with an apparent transiational initiation site at nucleotide 
positions 3-5 and ending at the stop codon at nucleotide positions 276-278 of SEQ ID NO:348 (Figure 243). 

35 The predicted polypeptide precursor is 91 amino acids long (Figure 244). The full-length PRO1097 protein 
shown in Figure 244 has an estimated molecular weight of about 10,542 daltons and a pi of about 10.04. Clone 
DNA5984 1-1460 has been deposited with ATCC on July 1, 1998. h is understood that the deposited clone has 
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^ actua, nucleic acid sequence and that the sequences provided herein axe based on Known sequent 

Mt Lyz* g Hgure 244, the signa. pept.de is at about amino acids 1-20 of SEQ ID NO:349. The 
gly copro.ease family protein domain starts at about amino acid 56, and the acy.transferase ChoActase/COT/CPT 
family peptide starts at about amino acid 49 of SEQ ID NO:349. 

5 

FY AMPLE 1Q8 - 1-- 1 -""" "f rnN A clone5 Enr od '' 1 '' Human PRQ1 1S3 

^7o7*e signal sequence algorhhm described in Example 3 above allowed identification of a stngle 
EST duster sequence from the Incyte database. This EST cluster sequence was then compared to a vanety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
10 EST DNA database (UFESEQ*. Incyte Pharmaceuticals. Pato Alto, CA) to identify existing homolog.es. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., M^M. 
E „ 2 v J r^266:46M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or m some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence wnh 
the program "phrap- (Phil Green. University.of Washington, Seattle, Washington). The consensus sequence 
1 5 obtained therefrom is herein designated DNA56008. 

in light of an observed sequence homology between the DNA56008 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2472409. the Incyte EST Cone 2472409 was purchased 
and me cDNA insert was obtained and sequenced. I. was found that this insert encoded a full-length proem. 
The seauence of this cDNA insert is shown in Figure 245 and is herein designated as DNA59842-1502. 
20 The full length clone shown in Figure 245 contained a single open reading frame with an apparent 

National initiation site at nucleotide positions 92-94 and ending a, the stop codon found at nucleotide positions 
683-685 (Figure 245; SEQ ID NO:350). The predicted polypeptide precursor (Figure 246, SEQ ID NO:351) 
is 197 am.no acids long. PR01153 has a calculated molecular weight of approximately 21,540 daltons and an 
estimated pi of approximately 8.31. Clone DNA59842-1502 has been deposited with ATCC and is assigned 
25 ATCC deposit no. 209982. It is understood that the correct and actual sequence is in the deposited clone while 
herein are present representations based on current sequencing techniques which may have minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence. PR01153 shows some amino acid sequence identity to the following Dayhoff designations: 
S57447; SOYHRGPCJ; S46965; P_P82971; VCPHEROPH.l; EXTN.TOBAC; MLCB2548.9; 
30 ANXA RABIT; JC5437 and SSGP_VOLCA. 

FXAMPLE 109 : Unlaiinn of cPN A clones E ncoding Human PROl 154 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
35 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals. Palo Alto. CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
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En7 vn,olo g y266:46M80(»996)). Those compansons rating in a BLAST score of 70 (or in so™ cases 90) 
or greater that did not encode known proteins were c.ustered and assembled into a consensus DNA sequence «*h 
the program "phrap- (Phil Green. Universtty of Washington. Seattle. Washington). The consensus sequence 
obtained therefrom is herein designated DNA56025 . 

in Ugh. of an observed sequence homology between the DNA56025 consensus sequence and an EST 
5 sequence encompassed within the Incyte EST Cone no. 2169375. the .ncyte EST Cone 2169375 was purchased 
and the cDNA insert was obtained and sequenced. It was found that mis insert encoded a full-length proton. 
The sequence of mis cDNA insert is shown in Figure 247 and is herein designated as DNA59846-1503. 

The full length clone shown in Figure 247 contained a single open reading frame with an apparent 
transanal initiation site a. nucleotide positions 86-88 and ending at the stop codon found a, nucleotide positions 
10 2909-291 1 (Figure 247; SEQ ID NO:352). The predicted polypeptide precursor (Figure 248. SEQ ID NO:353) 
is 941 amino acids long. PROl 154 has a calculated molecular weight of approximately 107. 144 daltons and an 
estimated pi of approximately 6.26. Cone DNA59846-1503 has been deposited with ATCC and is assigned 

ATCC deposit no. 209978. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 

15 leng* sequence, PR01154 shows sequence 
AMPN_HUMAN,IWinfi^ 
and AMPREHUM AN . 

FX A MPLE 110 : Unlatinn of cD NA clones Encodinp Human PROl lgl 
20 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database, designated herein as 82468. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

25 (AHshuletal., in Fnzvmologv 266:46(M80 (1996)). Those comparisons resulting in a BLAST score 

of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56029. 

In light of an observed sequence homology between the DNA56029 consensus sequence and an EST 

30 sequence encompassed within the Incyte EST clone no. 21 86536, the Incyte EST clone 2186536 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 249 and is herein designated as DNA59847-1511. 

Clone DNA59847-15I1 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 17-19 and ending at the stop codon at nucleotide positions 1328-1330 (Figure 249). 

35 The prated polypeptide precursor is 437 amino acids long (Figure 250). The full-length PROl 181 protein 
shown in Figure 250 has an estimated molecular weight of about 46,363 daltons and a pi of about 6.22 . Analysis 
of the full-length PROl 181 sequence shown in Figure 250 (SEQ ID NO:355) evidences the presence of the 
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following: a signal peptide from about amino acid 1 to about amino acid 15, potential N-giycosylation sites from 
about amino acid 46 to about amino acid 49, from about amino acid 189 to about amino acid 192 and from about 
amino acid 382 to about amino acid 385 and amino acid sequence blocks having homology to Ly-6/u-PAR 
domain proteins from about amino acid 287 to about amino acid 300 and from about amino acid 98 to about 
amino acid 111. Clone DNA59S47-1511 has been deposited with ATCC on August 4, 1998 and is assigned 
5 ATCC deposit no. 203098. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 250 (SEQ ID NO:355), evidenced homology 
between the PROU81 amino acid sequence and the following Dayhoff sequences: AF041083J, PJV26579, 
RNMAGPIANJ, CELT13C2_2, LMSAP2GNJ, S61882, CEF35C5J2, DP87JMCDI, GIU47631J and 
10 PJW7092. 

EXAMPLE 111 : Isolation of cDNA clones Encoding Human PRQ1182 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 146647. This EST cluster sequence was 

15 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

20 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56033. 

In light of an observed sequence homology between the DNA56033 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2595195, the Incyie EST clone 2595195 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 251 and is herein designated as DNA59848-1512. 

Clone DNA59848-1512 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 67-69 and ending at the stop codon at nucleotide positions 880-882 (Figure 251). The 
predicted polypeptide precursor is 27 1 amino acids long (Figure 252). The full-length PRO 1 1 82 protein shown 
in Figure 252 has an estimated molecular weight of about 28,665 daltons and a pi of about 5.33. Analysis of 

30 the full-length PROI182 sequence shown in Figure 252 (SEQ ID NO:357) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25, an amino acid block having 
homology to C-rype lectin domain proteins from about amino acid 247 to about amino acid 256 and an amioo 
acid sequence block having homology to CJq domain proteins from about amino acid 44 to about amino acid 
77. Clone DNA59848-I512 has been deposited with ATCC on August 4, 1998 and is assigned ATCC deposit 

35 no. 203088. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 252 (SEQ ID NO: 357), evidenced significant 
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homology between the PR01182 amino acid sequence and the following Dayhoff sequences: PSPDBOVIN, 
CL43_BOVIN, CONG_BOVIN, PW18780, P_R45005, PR53257 and CELEGAP7_1. 

EXAMPLE 1 12 : Isolation of cDNA clones Encoding Human PRO 11 55 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56102. 

In light of an observed sequence homology between the DNA56102 consensus sequence and an EST 
sequence encompassed within the lncyte EST clone no. 2858870. the lncyte EST clone 2858870 was purchased 
and the cDN A insert was obtained and sequenced. It was found that this insert -encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 253 and is herein designated as DNA59849-1504. 

The full length clone shown in Figure 253 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 158-160 and ending at the stop codon found at nucleotide 
positions 563-565 (Figure 253; SEQ ID NO:358). The predicted polypeptide precursor (Figure 254, SEQ ID 
NO:359) is 1 35 amino acids long. PRO 1 155 has a calculated molecular weight of approximately 14,833 daltons 
and an estimated pi of approximately 9.78. Clone DNA59849-1504 has been deposited with ATCC and is 
assigned ATCC deposit no. 209986. It is understood that the actual clone has the correct sequence whereas 
herein are only representations which are prone to minor sequencing errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PROl 155 shows some amino acid sequence identity with the following Dayhoff designations: 
TKNK_BOVIN; PVB19X587J; AF019049J; PW00948; S72864; P_W00949; 162742; AF038501J; 
TKNG HUMAN; and YAT1RHOBL. Based on the information provided herein, PROl 155 may play a role 
in providing neuroprotection and cognitive enhancement. 

EXAMPLE 113: Isolation of cDNA clones Encoding Human PRO 11 56 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the lncyte database, designated herein as 138851. This EST cluster sequence was 
then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ* lncyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al. t Methods in Enzvmologv 266:46fui«n (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
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assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56261. 

In light of an observed sequence homology between the DNA56261 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3675191, the Incyte EST clone 3675191 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
5 The sequence of this cDNA insert is shown in Figure 255 and is herein designated as DNA59853-1505. 

The full length clone shown in Figure 255 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 212-214 and ending at the stop codon found at nucleotide 
positions 689-691 (Figure 255; SEQ ID NO:360). The predicted polypeptide precursor (Figure 256. SEQ ID 
NO:361)is 159 amino acids long. PRO 1 156 has a calculated molecular weight of approximately 17,476 daltons, 
10 an estimated pi of approximately 9.15, a signal peptide sequence at about amino acids 1 to about 22, and 
potential N-glycosylation sites at about amino acids 27-30 and 41-44. 

Clone DNA59853-1505 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209985. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
1 5 alignment analysis (using the ALIGN computer program) of the full-length sequence shown in Figure 256 (SEQ 
ID NO:361), revealed some homology between the PROl 156 amino acid sequence and the following Dayhoff 
sequences: D45027 1, PR799I4, JC5309, KBF2HUMAN, AF010144J, GEN14351, S68681, P_R79915, 
ZMTAC 3, and HUMCPGO 1. 



20 EXAMPLE 114: Isolation of c DNA Clones Encoding Human PRQ1Q98 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
25 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
EnzymoloRY 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56377. 
30 In light of an observed sequence homology between the DNA56377 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3050917, the Incyte EST clone 3050917 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 257 and is herein designated as DNA59854-1459. 

The entire nucleotide sequence of DNA59854-1459 is shown in Figure 257 (SEQ ID NO:362). Clone 
35 DNA59854-1459 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 58-60 and ending at the stop codon at nucleotide positions 292-294 of SEQ ID NO:362 (Figure 257). 
The predicted polypeptide precursor is 78 amino acids long (Figure 258). The full-length PRO1098 protein 
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shown in Figure 258 has an estimated molecular weight of about 8,3% daltons and a pi of about 7.66. Clone 
DNA59854-1459 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 258, a signal peptide appears to be at about amino acids 1-19 of SEQ ID NO;363, 
5 an N-glycosylation site appears to be at about amino acids 37-40 of SEQ ID NO:363, and N-myristoylation sites 
appear to be at about 15-20, 19-24 and 60-65 of SEQ ID NO:363. 

EXAMPLE 115 : Isolation of cDNA clones Encoding Human FRO 1 127 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

10 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (AltshuJ et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

15 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57959. 

In light of an observed sequence homology between the DNA57959 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 685 126, the Merck EST clone 685126 was purchased 

20 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 259 and is herein designated as DNA60283-1484, 

The full length clone shown in Figure 259 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 126-128 and ending at the stop codon found at nucleotide 
positions 327-329 (Figure 259; SEQ ID NO:364). The predicted polypeptide precursor (Figure 260, SEQ ID 

25 NO:365) is 67 amino acids long including a signal peptide at about 1-29 of SEQ ID NO:365. PRO 1 127 has a 
calculated molecular weight of approximately 7,528 daltons and an estimated pi of approximately 4.95. Clone 
DNA60283-1484 was deposited with the ATCC on July I, 1998 and is assigned ATCC deposit no. 203043. 
It is understood that the deposited clone has the actual sequence, whereas representations which may have minor 
sequencing errors are presented herein. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 260 (SEQ ID NO:365), revealed some homology 
between the PROl 127 amino acid sequence and the following Dayhoff sequences: AF037218_48, P_W09638, 
HBA_HETPO, S39821, KR2 EBV, CET20D3 8, HCU37630 1, HS193B12J0. S40012 and TRITUBCJ. 

35 
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EXAMPLE 116 : Isolation of cDNA clones Encoding H uman PROl 126 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater thai did not encode known proteins were clustered and assembled inio a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56250. 
10 In light of an observed sequence homology between the DNA56250 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 1437250, the Incyte EST clone 1437250 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 261 and is herein designated as DNA60615-1483. 

Clone DNA60615-1483 contains 'a single open reading frame with an apparent translational initiation 
15 site at nucleotide positions 1 10-1 12 and ending at the stop codon at nucleotide positions 1316-1318 (Figure 261). 
The predicted polypeptide precursor is 402 amino acids long (Figure 262). The full-length PRO 11 26 protein 
shown in Figure 262 has an estimated molecular weight of about 45,92 1 daltons and a pi of about 8.60. Analysis 
of the full-length PROl 126 sequence shown in Figure 262 (SEQ ID NO:367) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25 and potential N-glycosylation sites 
20 from about amino acid 66 to about amino acid 69, from about amino acid 1 38 to about amino acid 14 1 and from 
about amino acid 183 to about amino acid 186. Clone DNA60615-I483 has been deposited with ATCC on June 
16, 1998 and is assigned ATCC deposit no. 209980. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 262 (SEQ ID NO:367), evidenced significant 
25 homology between the PROl 126 amino acid sequence and the following Dayhoff sequences: 173636, 
NOMRHUMAN, MMUSMYOC3J, HS454G6J, P_R98225, RNU78105J, RNU72487J, AF035301_U 
CEELC48E7 4 and CEFI 1C33. 

EXAMPLE 117 : Isolation of cDNA clones Encoding Human PROl 125 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

35 Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56540. 

In light of an observed sequence homology between the DNA56540 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 14861 14, the Incyte EST clone 14861 14 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 263 and is herein designated as DNA60615-1483. 
5 The full length clone shown in Figure 263 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 47-49 and ending at the stop codon found at nucleotide positions 
1388-1390 (Figure 263; SEQ ID NO:368). The predicted polypeptide precursor (Figure 264, SEQ ID NO:369) 
is 447 amino acids long. PRO 1 125 has a calculated molecular weight of approximately 49,798 daltons and an 
estimated pi of approximately 9.78. Clone DNA60619-1482 has been deposited with ATCC and is assigned 

1 0 ATCC deposit no. 209993. It is understood that the clone has the actual sequence and that the sequences herein 
are representations based on current techniques which may be prone to minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PRO 1125 shows some sequence identity with the following Dayhoff designations: 
RC01_NEUCR; S58306; PKWA_THECU; S76086; P_R85881; HETl_PODAN; SPU92792J; 

15 APAF_HUMAN; S76414 and S59317. 

EXAMPLE 118 : Isolation of cDNA clones Encoding Human PRO 11 86 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

20 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzymology 266:460-480 ( 1 996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

25 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56748. 

In light of an observed sequence homology between the DNA56748 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3476792, the Incyte EST clone 3476792 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 265 and is herein designated as DNA60621-1516. 

The full length clone shown in Figure 265 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 91-93 and ending at the stop codon found at nucleotide positions 
406-408 (Figure 265; SEQ ID NO:370). The predicted polypeptide precursor (Figure 266, SEQ ID NO:371) 
is 105 amino acids long. The signal peptide is at amino acids 1-19 of SEQ ID NO:371. PR01186 has a 

35 calculated molecular weight of approximately 1 1 ,715 daltons and an estimated pi of approximately 9.05. Clone 
DNA60621-1516 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203091. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 266 (SEQ ID NO:371), revealed some sequence 
identity between the PR01186 amino acid sequence and the following Dayhoff sequences: VPRA_DENPO, 
LFE4CHICK, AF034208J, AF030433J, A55035, COL_RABIT t CELB0507_9, S67826J, S34665 and 
CRU73817J. 

5 

EXAMPLE 119 : Isolation of cDNA clones EncodinE Human PRO 1 198 

An initial DNA sequence referred to herein as DNA52083 was identified using a yeast screen in a 
human umbilical vein endothelial cell cDNA library that preferentially represents the 5' ends of the primary 
cDNA clones. DNA52083 was compared to ESTs from public databases (e.g., GenBank), and a proprietary 

10 EST database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA), using the computer program BLAST or 
BLAST2 lAltschul et al.. Methods in Enzvmologv. 266:460-480 (1996)]. The ESTs were clustered and 
assembled into a consensus DNA sequence using the computer program "phrap" (Phil Green, University of 
Washington, Seattle, Washington). One or more of the ESTs was obtained from human breast skin tissue 
biopsy. This consensus sequence is designated herein as DNA52780. 

15 In light of an observed sequence homology between the DNA52780 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3852910, the Incyte EST clone 3852910 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 267 and is herein designated as DNA60622-1525. 

The full length DNA60622-1525 clone shown in Figure 267 (SEQ ID NO:372) contained a single open 

20 reading frame with an apparent translation^ initiation site at nucleotide positions 54 to 56 and ending at the stop 
codon found at nucleotide positions 741 to 743. The predicted polypeptide precursor, which is shown in Figure 
268 (SEQ ID NO:373), is 229 amino acids long. PROl 198 has a calculated molecular weight of approximately 
25,764 daltons and an estimated pi of approximately 9. 17. There is a signal peptide sequence at about amino 
acids 1 through 34. There is sequence identity with glycosyl hydrolases family 3 1 protein at about amino acids 
25 142 to about 175. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WTJ-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 268 (SEQ ID NO:373), revealed some homology 
between the PROl 198 amino acid sequence and the following Dayhoff sequences: ATF6H1 1_6, UCRI_RAT, 
TOBSUP2NTJ, RCUERF3J, AMU88186J, P_W22485, S56579, AF04071IJ, DPP4 PIG. 
30 Clone DNA60622-1525 was been deposited with the ATCC on August 4, 1998, and is assigned ATCC 

deposit no. 203090. 



35 



EXAMPLE 120: Isolation of cDNA clones Encoding Human PROl 158 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 



472 



L 



WO 99/63088 



PCT/US99/12252 



homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57248. 
5 In light of an observed sequence homology between the DNA57248 consensus sequence and an EST 

sequence encompassed within the lncyte EST clone no. 2640776, the Incyte EST clone 2640776 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 269 and is herein designated as DNA60625-I507. 

The full length clone shown in Figure 269 contained a single open reading frame with an apparent 
10 translational initiation site at nucleotide positions 163 to 165 and ending at the stop codon found at nucleotide 
positions 532 to 534 (Figure 269; SEQ ID NO:374). The predicted polypepude precursor (Figure 270, SEQ 
ID NO:375) is 123 amino acids long. PROH58 has a calculated molecular weight of approximately 13,113 
daltons and an estimated pi of approximately 8.53. Additional features include a signal peptide sequence at about 
amino acids 1-19, a transmembrane domain at about amino acids 56-80, and a potential N-glycosylation site at 
15 about amino acids 36-39. Clone DNA60625-1507 was deposited with the At CC on June 16, 1998 and is 
assigned ATCC deposit no. 209975. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 270 (SEQ ID NO:375), revealed some homology 
between the PROl 158 arnino acid sequence and the following Dayhoff sequences: ATAC00310510F18A8.10, 
20 P_R85151, PHS2_SOLTU, RNMHCIBAC1 , RNA1FMHC1, 168771, RNRT1A10GJ, PTPA_HUMAN, 
HUMGACAl, and CHKPTPAJ. 



EXAMPLE 121 : Isolation of cDNA clones Encoding Human PROl 159 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57221. 

In light of an observed sequence homology between the DNA57221 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 376776, the Incyte EST clone 376776 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 271 and is herein designated as DNA 60627- 1508. 
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Clone DNA60627-1508 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 92-94 and ending at the stop codon at nucleotide positions 362-364 (Figure 27 1 ). The 
predicted polypeptide precursor is 90 amino acids long (Figure 272). The full-length PROl 159 protein shown 
in Figure 272 has an estimated molecular weight of about 9,840 daltons and a pi of about 10.13. Analysis of 
the full-length PROl 159 sequence shown in Figure 272 (SEQ ID NO:377) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 15 and a potential N-glycosylation site 
from about amino acid 38 to about amino acid 41. Clone DNA60627-1508 has been deposited with ATCC on 
August 4, 1998 and is assigned ATCC deposit no. 203092. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 272 (SEQ ID NO:377), evidenced significant 
10 homology between the PROl 159 amino acid sequence and the following Dayhoff sequences: AF016494_6, 
AF036708_20, DSSCUTEJ, D89100J, S28060, MEFA_XENLA, AF020798J2.G70065, E64423.JQ2005. 



EXAMPLE 122: Isolation of cDNA clones Encoding Human PROl 124 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
15 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. Tne 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzympiog^266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
20 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56035. 

In light of an observed sequence homology between the DNA56035 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2767646, the Incyte EST clone 2767646 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 273 and is herein designated as DNA60629-I481. 

The full length clone shown in Figure 273 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 25-27 and ending at the stop codon found at nucleotide positions 
2782-2784 (Figure 273; SEQ ID NO:378). The predicted polypeptide precursor (Figure 274, SEQ ID NO:379) 
is 919 ammo acids long. PROl 124 has a calculated molecular weight of approximately 101 ,282 daltons and an 
estimated pi of approximately 5.37. Clone DNA60629-1481 has been deposited with the ATCC and is assigned 
ATCC deposit no. 209979. It. is understood that the deposited clone has the actual sequence, whereas only 
representations based on current sequencing techniques which may include normal and minor errors, are 
provided herein. 

Based on a WU-BLAST2 sequence alignment analysis of the full-length sequence, PROl 124 shows 
significant ammo acid sequence identity to a chloride channel protein and to ECAM-1. Specifically, the 
following Dayhoff designations were identified as having sequence identity with PROl 124: ECLC BOVIN, 
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AF001261J, P_W06548, SSC6A10J, AF004355J, S76691, AF017642, BYU06866_2, CSA_DICDI and 

SAU47139_2. 

EXAMPLE 123 : Isolation of cDNA clones Encoding Human PRO 1287 

An expressed sequence lag (EST) DNA database (L1FESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
5 was searched and an EST was identified which showed homology to the fringe protein. This EST sequence was 
then compared to various EST databases including public EST databases (e.g . , GenBank), and a proprietary EST 
database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The 
comparison was performed using the computer program BLAST or BLAST2 [Altschul et ah, Methods in 
Enzvmologv. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 

10 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence 
with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus 
sequence obtained is herein designated DNA40568. 

Based on the DNA40568 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR 
a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full- 

15 length coding sequence for PRO 1287. Forward and reverse PCR primers generally range from 20 to 30 
nucleotides and are often designed to give a PCR product of about 100-1000 bp in length. The probe sequences 
are typically 40-55 bp in length. In some cases, additional oligonucleotides are synthesized when the consensus 
sequence is greater than about l-1.5kbp. In order to screen several libraries for a full-length clone, DNA from 
the libraries was screened by PCR amplification, as per Ausubel et ah, Current Protocols in Molecular Biology. 

20 supra, with the PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest 
using the probe oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
f orward PCR primer 5 '-CTCGGGG A AAG GG ACTTG ATGTTGG-3 ' (SEQ ID NO:382) 
reverse PCR primer 1 5'-GCGAAGGTGAGCCTCTATCTCGTGCC-3' (SEQ ID NO:383) 

25 reverse PCR primer 2 5 ' -C AGCCTAC ACGT ATTG AGG-3 ' (SEQ ID NO: 384) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40568 
sequence which had the following nucleotide sequence 
hybridization probe 

5'-CAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCC-3' (SEQ ID NO:385). 

30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to isolate clones encoding the PRO 1287 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human bone marrow tissue. The cDNA 
libraries used to isolated the cDNA clones were constructed by standard methods using commercially available 

35 reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI 
site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
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pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al., Science , 253:1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full -length DNA sequence for 
PR01287 (designated herein as DNA6I755-1554 [Figure 275, SEQ ID NO:380]) and the derived protein 
sequence for PRO 1287. 

The entire nucleotide sequence of DNA61755-1554 is shown in Figure 275 (SEQ ID NO:380). The 
full length clone contained a single open reading frame with an apparent translational initiation site at nucleotide 
positions 655-657 and a stop signal at nucleotide positions 2251-2253 (Figure 275, SEQ ID NO:380). The 
predicted polypeptide precursor is 532 amino acids long, has a calculated molecular weight of approximately 
61 ,351 daltons and an estimated pi of approximately 8.77. Analysis of the full-length PRO 1 287 sequence shown 
in Figure 276 (SEQ ID NO:381) evidences the presence of the following: a signal peptide from about amino acid 
1 to about amino acid 27 and potential N-glycosylation sites from about amino acid 315 to about amino acid 318 
and from about amino acid 324 to about amino acid 327. Clone DNA6 1755- 1554 has been deposited with 
ATCC on August 11, 1998 and is assigned ATCC deposit no. 203112. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 276 (SEQ ID NO:38I), evidenced significant 
homology between the PRO 1287 amino acid sequence and the following Dayhoff sequences: CET24D1_1, 
EZRIBOVIN, GGU19889J, CC3YEAST, S74244, NALS MOUSE, MOES_PIG. S28660, S44860 and 
YNA4_CAEEL. 

EXAMPLE 124 : Isolation of cDNA clones Encoding Human PRQ1312 

DN A55773 was identified in a human fetal kidney cDNA library using a yeast screen that preferentially 
represents the 5* ends of the primary cDNA clones. Based on the DNA55773 sequence, oligonucleotides were 
synthesized for use as probes to isolate a clone of the full-length coding sequence for PR01312. 

The full length DNA61873-1574 clone shown in Figure 277 (SEQ ID N0:386) contained a single open 
reading frame with an apparent translational initiation site at nucleotide positions 7-9 and ending at the stop 
codon found at nucleotide positions 643-645. The predicted polypeptide precursor is 212 amino acids long 
(Figure 278, SEQ ID N0:387). PR01312 has a calculated molecular weight of approximately 24,024 daltons 
and an estimated pi of approximately 6.26. Other features include a signal peptide at about amino acids 1-14; 
a transmembrane domain at about amino acids 141-160, and potential N-glycosylation sites at about amino acids 
76-79 and 93-96. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 278 (SEQ ID NO: 387), revealed some homology 
between the PRO 1312 amino acid sequence and the following Dayhoff sequences: GCINTALPH 1, 
GIBMUCIAJ, P_R96298, AF001406_1, PVU88874J, PR85151, AF041409J, CELC50F2_7, C45875, 
and AB009510_21. 

Clone DNA6 1873- 1574 has been deposited with ATCC and is assigned ATCC deposit no. 203132. 
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FX AMPLE 125 : Isolation of cDNA clones Encoding Human PROl 192 

A consensus DNA sequence was assembled relative io other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein DNA35924. Based on the DNA35924 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PROH92. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5'-CCGAGGCCATCTAGAGGCCAGAGC-3 ' {SEQ ID NO:390) 
reverse PCR primer : 5'-ACAGGCAGAGCCAATGGCCAGAGC-3' (SEQ ID NO:391). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
10 DNA35924 sequence which had the following nucleotide sequence: 
hybridization probe : 

5--GAGAGGACTGCGGGAGTTTGGGACCTTTGTGCAGACGTGCTCATG-3' (SEQ ID NO:392). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
15 isolate clones encoding the PRO 1 192 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal liver and spleen tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 192 designated herein as DNA628I4-1521 and shown in Figure 279 (SEQ ID NO:388); and the derived 
protein sequence for PR01192 which is shown in Figure 280 (SEQ ID NO:389). 
20 The entire coding sequence of PROH92 is shown in Figure 279 (SEQ ID NO:388). Clone DNA62814- 

1521 contains a single open reading frame with an apparent iranslational initiation site at nucleotide positions 
121 - 123 and an apparent stop codon at nucleotide positions 766-768. The predicted polypeptide precursor is 215 
amino acids long. The predicted polypeptide precursor has the following features: a signal peptide at about 
amino acids 1-21; a transmembrane domain at about amino acids 153-176; potential N-glycosylation sites at 
25 about amino acids 39-42 and 118-121; and homology with myelin P0 proteins at about amino acids 27-68 and 
99-128 of Figure 280. The full-length PROl 192 protein shown in Figure 280 has an estimated molecular weight 
of about 24,484 daltons and a pi of about 6.98. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 280 (SEQ ID NO:389), revealed homology 
30 between the PRO 11 92 amino acid sequence and the following Dayhoff sequences: GEN12838, MYP0_HUMAN, 
AF049498 1, GEN14531, P_W14146, HS46KDAJ, CINB RAT, OX2G_RAT, D87018J, and D86996_2. 

Clone DNA62814-1521 was deposited with the ATCC on August 4, 1998, and is assigned ATCC 
deposit no. 203093. 

35 
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EXAMPLE 126 : Isolation of cPNA clones E ncoding Human PRO 11 60 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above This consensus sequence is herein designated DNA40650. Based on the DNA40650 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PRO1160. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-GCTCCCTGATC'^^CATGTCACCACC-3^ (SEQ ID NO:395) 
mverse PCR primer 5'-C AGGG ACACACTCTACCATTCGGGAG-3 ' (SEQ ID NO:396) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40650 
10 sequence which had the following nucleotide sequence 
hybridization probe 

5 '-CCATCTTTCTGGTCTCTGCCCAGAATCCG AC AACAGCTGCTC-3 ' (SEQ ID NO:397) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above, A positive library was then used to 
15 isolate clones encoding the PRO 1 160 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human breast tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PRO1160 (designated herein as DNA62872-1509 [Figure 281, SEQ ID NO: 393]) and the derived protein 

sequence for PRO 1160. 

20 The entire nucleotide sequence of DNA62872-1509 is shown in Figure 281 (SEQ ID NO:393). Clone 

DNA62872-1509 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 40-42 and ending at the stop codon at nucleotide positions 310-312 (Figure 281). The predicted 
polypeptide precursor is 90 amino acids long (Figure 282). The full-length PROl 160 protein shown in Figure 
282 has an estimated molecular weight of about 9,039 daltons and a pi of about 4.37. Analysis of the full-length 

25 PROl 160 sequence shown in Figure 282 (SEQ ID NO:394) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 19 and a protein kinase C phosphorylation site from about 
amino acid 68 to about amino acid 70. Clone DNA62872-1509 has been deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203100. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

30 alignment analysis of the full-length sequence shown in Figure 282 (SEQ ID NO:394), evidenced significant 
homology between the PROl 160 amino acid sequence and the following Dayhoff sequences: B3O305, 
GEN13490, 153641, S53363, HA34_BRELC, SP96_DICDI, S36326, SSU5119710, MUC1_XENLA, 
TCU32448J and AF0OO409J. 

35 EXAMPLE 127 : Isolation of cDNA clones Encodine Human PROl 187 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
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expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ*, lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmologv 266:460-480 ( 1 996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
5 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57726. 

In light of an observed sequence homology between the DNA57726 consensus sequence and an EST 
sequence encompassed within the lncyte EST clone no. 358563, the lncyte EST clone 358563 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

10 The sequence of this cDNA insert is shown in Figure 283 and is herein designated as DNA62876-1517. 

The full length clone shown in Figure 283 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 121-123 and ending at the stop codon found at nucleotide 
positions 481-483 (Figure 283; SEQ ID NO:398). The predicted polypeptide precursor (Figure 284, SEQ ID 
NO:399) is 120 amino acids long. The signal peptide is at about amino acids 1-17 of SEQ ID NO:399. 

15 PR01I87 has a calculated molecular weight of approximately 12,925 daltons and an estimated pi of 
approximately 9.46. Clone DNA62876-1517 was deposited with the ATCC on August 4, 1998 and is assigned 
ATCC deposit no. 203095. It is understood that the deposited clone contains the actual sequence and that the 
representations herein may have minor sequencing errors. 

,\j analysts of the Dayhoff database (version 35 45 SwissProt 35), using a WU-BLAST2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 284 (SEQ ID NO:399), revealed some sequence 
identity (and therefore some relation) between the PRO 11 87 amino acid sequence and the following Dayhoff 
sequences: MGNENDOBXl, CELF41G3_9, AMPG_STRLI, HSBBOVHERL2, LEEXTEN101, 
AF029958J and P W04957. 

25 EXAMPLE 128 : Isolation of cDNA clones Encoding Human PRO 1 185 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ*. lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

30 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460^80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56426. 

35 In light of an observed sequence homology between the DNA56426 consensus sequence and an EST 

sequence encompassed within the lncyte EST clone no. 328441 1, the lncyte EST clone 328441 1 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 285 and is herein designated as DNA62881-1515. 

The full length DNA62881-1515 clone shown in Figure 285 contained a single open reading frame with 
an apparent translation^ initiation site at nucleotide positions 4-6 and ending at the stop codon found at 
nucleotide positions 598-600 (Figure 285; SEQ ID NO:400). The predicted polypeptide precursor (Figure 286, 
SEQ ID NO:401) is 198 amino acids long. The signal peptide is at about amino acids 1-21 of SEQ ID NO:401. 
5 PR01185 has a calculated molecular weight of approximately 22,105 daltons and an estimated pi of 
approximately 7.73. Clone DNA62881-1515 has been deposited with the ATCC and is assigned ATCC deposit 
no. 203096. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 286 (SEQ ID NO:401), revealed some sequence 
10 identity between the PR01185 amino acid sequence and the following Dayhoff sequences: TUP1_YEAST, 
AF041382J, MAOM_SOLTU. SPPBPHU9J.I41024, EPCPLCFAILJ, HSPLECJ, YKL4_CAEEL, 
A44643, TGU65922J. 

EXAMPLE 129 : Isolation of cDNA clones Encoding Human PRO 1345 
15 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA47364. Based on the DNA47364 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1345 

20 PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 '-CCTGGTTATCCCCAGGAACTCCG AC-3 ' (SEQ ID NO:404) 

reverse PCR primer 5'-CTCTTGCTGCTGCGACAGGCCTC-3' (SEQ ID NO:405) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47364 

sequence which had the following nucleotide sequence 

25 hybridization probe 

5 '-CGCCCTCC AAGACTATGGT AAA AGGAGCCTGCC AGGTGTCA ATGAC-3 ' (SEQ ID NO:406) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PRO 1345 gene using the probe oligonucleotide and one of the PCR primers. RNA 
30 for construction of the cDNA libraries was isolated from human breast carcinoma tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PRO 1345 (designated herein as DNA64852-1589 [Figure 287, SEQ ID NO:402]) and the derived protein 

sequence for PRO 1345. 

The entire nucleotide sequence of DNA64852-1589 is shown in Figure 287 (SEQ ID NO:402). Clone 
35 DNA64852-1589 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 7-9 or 34-36 and ending at the stop codon at nucleotide positions 625-627 (Figure 287). The predicted 
polypeptide precursor is 206 amino acids long (Figure 288). The full-length PR01345 protein shown in Figure 
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288 has an estimated molecular weight of about 23, 190 daltons and a pi of about 9.40. Analysis of the full- 
length PRO 1345 sequence shown in Figure 288 (SEQ ID NO:403) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 3 1 or from about amino acid 10 to about amino acid 
31 and a C-type lectin domain signature sequence from about amino acid 176 to aboui amino acid 190. Clone 
DNA64852-1589 has been deposited with ATCC on August 18, 1998 and is assigned ATCC deposit no. 203127. 
5 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 288 (SEQ ID NO:403) f evidenced significant 
homology between the PRO 1345 amino acid sequence and the following Dayhoff sequences: BTU22298 1, 
TETN_CARSP, TETNHUMAN, MABA_RAT, S34198, P_W13144, MACMBPAJ, A46274, PSPD_RAT 
AND P_R32188. 

10 

FX AMPLE 130 : Isolation of cDNA clones Encoding Human PRO 1245 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 

15 EST DNA database (LIFESEQ* lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

20 obtained therefrom is herein designated DNA56019. 

In light of an observed sequence homology between the DNA56019 consensus sequence and an EST 
sequence encompassed within the lncyte EST clone no. 1327836, the lncyte EST clone 1327836 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 289 and is herein designated as DNA64884^1527. 

25 The full length clone shown in Figure 289 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 79-81 and ending at the stop codon found at nucleotide positions 
391-393 (Figure 289; SEQ ID NO:407). The predicted polypeptide precursor (Figure 290, SEQ ID NO:408) 
is 104 amino acids long, with a signal peptide sequence at about amino acid 1 to about amino acid 18. 
PRO 1245 has a calculated molecular weight of approximately 10,100 daltons and an estimated pi of 

30 approximately 8.76. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 290 (SEQ ID NO:408), revealed some homology 
between the PR01245 amino acid sequence and the following Dayhoff sequences: SYA THETH, GEN1 1 167, 
MTV044_4, AB011151J, RLAJ2750_3, SNELIPTRA 1, S63624, C28391, A37907, and S14064. 

35 Clone DNA64884-1245 was deposited with the ATCC on August 25, 1998 and is assigned ATCC 

deposit no. 203155. 
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EXAMPLE 131 : Isolation of cDNA clones Encoding Human PRO 1358 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. t GenBank) and a proprietary 
EST DNA database (UFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or B LAS 17 (Altshul et ah, Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 88718, the Incyte EST clone 88718 was purchased and the cDNA 
insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The sequence 
of this cDNA insert is shown in Figure 291 and is herein designated as DNA64890-16I2. 

The full length clone shown in Figure 291 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86 through 88 and ending at the stop codon found at nucleotide 
positions 1418 through 1420 (Figure 291; SEQ ID NO:409). The predicted polypeptide precursor (Figure 292, 
SEQ ID NO:410) is 444 amino acids long. The signal peptide is at about amino acids 1-18 of SEQ ID NO:410. 
PRO 1358 has a calculated molecular weight of approximately 50,719 daltons and an estimated pi of 
approximately 8.82. Clone DNA64890-1612 was deposited with the ATCC on August 18, 1998 and is assigned 
ATCC deposit no. 203131. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 292 (SEQ ID NO:410), revealed sequence identity 
between the PRO 1358 amino acid sequence and the following Dayhoff sequences: P W07607, AB000545_1, 
AB0O0546_l , A1ATJRAT, AB015 164 J , P_P5002 1 , COTRCA VPO, and H AMHPPi . The variants claimed 
in this application exclude these sequences. 

EXAMPLE 132 : Isolation of cDNA clones Encoding Human PROl 195 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. . GenBank) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
EnzvmologY 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA557I6. 

In light of an observed sequence homology between the DNA55716 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3252980, the Incyte EST clone 3252980 was purchased 
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and the cDNA insert was obtained and sequenced, It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 293 and is herein designated as DNA65412-1523. 

The full length clone shown in Figure 293 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 58-60 and ending at the stop codon found at nucleotide positions 
511-513 (Figure 293; SEQ ID NO:411). The predicted polypeptide precursor (Figure 294, SEQ ID NO:4I2) 
5 is 151 amino acids long. The signal sequence is at about amino acids 1-22 of SEQ ID NO:412. PRO 1195 has 
a calculated molecular weight of approximately 17,277 daltons and an estimated pi of approximately 5.33. Clone 
DNA65412-1523 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203094. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 294 (SEQ ID NO:412), revealed some sequence 
10 identity between the PRO 1195 amino acid sequence and the following Dayhoff sequences: MMU28486_1, 
AF044205_1, P_W31186, CELK03C7J, F69034, EF1AMETVA, AF024540_1, SSU90353J, 
MRSP_STAAU and P_R97680. 

EXAMPLE 133 : Isolation of cDNA clones Encodine Human PRO1270 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

20 Enzvmology 266:460-480 ( 1 996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57951. 

In light of an observed sequence homology between the DNA57951 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. 124878, the Merck EST clone 124878 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-lehgth protein. 
The sequence of this cDNA insert is shown in Figure 295 and is herein designated as DNA66308-1537. 

Clone DNA66308-1537 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 103-105 and ending at the stop codon at nucleotide positions 1042-1044 (Figure 295). 

30 The predicted polypeptide precursor is 313 amino acids long (Figure 296). The full-length PRO1270 protein 
shown in Figure 296 has an estimated molecular weight of about 34,978 daltons and a pi of about 5.71 . Analysis 
of the full-length PRO 1270 sequence shown in Figure 296 (SEQ ID NO:414) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 16, a potential N-glycosylation site from 
about amino acid 163 to about amino acid 166 and glycosaminoglycan attachment sites from about amino acid 

35 74 to about amino acid 77 and from about amino acid 289 to about amino acid 292. Clone DNA66308-1537 has 
been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203159. 
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An analysis of ihe Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 296 (SEQ ID NO:414), evidenced significant 
homology between the PRO1270 amino acid sequence and the following Dayhoff sequences: XLU86699_I, 
S49589,FIBA_PARPA f FIBB_HUMAN f P_R47l89 t AF0O4326_l,DRTENASCN_KAF004327_l,P_W014n 

and FIBG_B0V1N. 

5 

EXAMPLE 134 : Isolation of cDNA clones Encoding Human PRO 1271 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 

10 EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program u phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA57955. 

In light of an observed sequence homology between the DNA57955 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA625350, the Merck EST clone AA625350 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 297 and is herein designated as DNA66309-1538. 

20 Clone DNA66309-1538 contains a single open reading frame with an apparent translauonal initiation 

site at nucleotide positions 94-96 and ending at the stop codon at nucleotide positions 718-720 (Figure 297). The 
predicted polypeptide precursor is 208 amino acids long (Figure 298). The full-length PR01271 protein shown 
in Figure 298 has an estimated molecular weight of about 21,531 daltons and a pi of about 8.99. Analysis of 
the full-length PR01271 sequence shown in Figure 298 (SEQ ID NO:416) evidences the presence of the 

25 following: a signal peptide from about amino acid 1 to about amino acid 31 and a transmembrane domain from 
about amino acid 166 to about amino acid 187. Clone DNA66309-1538 has been deposited with ATCC on 
September 15, 1998 and is assigned ATCC deposit no. 203235. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 298 (SEQ ID NO:416), evidenced significant 

30 homology between the PR01271 amino acid sequence and the following Dayhoff sequences: S57180, S63257, 
AGA1_YEAST. BPU43599J, YS8A CAEEL, S67570, LSU54556_2, S70305, VGLX_HSVEB, and 
D88733J. 

EXAMPLE 135 : Isolation of cDNA clones Encoding Human PRQ1375 
35 A Merck/Wash. U. database was searched and a Merck EST was identified. This sequence was then 

put in a program which aligns it with other seequences from the Swiss-Prot public database, public EST 
databases (e.g., GenBank, Merck/Wash. U.), and a proprietary EST database (LIFESEQ* Incyte 
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Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
[Altschul et ah, Methods in EnzYmology. 266:460^80 (1996)] as a comparison of the extracellular domain 
(ECD) protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a 
BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
5 Seattle, Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. This 
consensus sequence is designated herein "DNA67003". 

Based on theDNA67003 consensus sequence, the nucleic acid (SEQ ID NO:417) was identified in a 
human pancreas library. DNA sequencing of the clone gave the full-length DNA sequence for PR01375 and 
10 the derived protein sequence for PRO 1375. 

The entire coding sequence of PR01375 is shown in Figure 299 (SEQ ID NO:4 17). Clone DNA67004- 
1614 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
104-106 and an apparent stop codon at nucleotide positions 698-700 of SEQ ID NO:417. The predicted 
polypeptide precursor is 198 amino acids long. The transmembrane domains are at about amino acids 1 1-28 
15 (type II) and 103-125 of SEQ ID NO:418. Clone DNA67004-I614 has been deposited with ATCC and is 
assigned ATCC deposit no. 203115. The full-length PRO 1375 protein shown in Figure 300 has an estimated 
molecular weight of about 22,531 daltons and a pi of about 8.47. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 300 (SEQ ID NO:4 1 8), revealed sequence identity 
20 between the PRO 1375 amino acid sequence and the following Dayhoff sequences: AF026198_5, CELR12C12_5, 
S73465, Y011_MYCPN, S64538J, P_P8150. MUVSHPO10J, VSH_MUMPL and CVU59751_5. 

EXAMPLE 136 : Isolation of cDNA clones Encoding Human PRO 1385 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et a]., Methods in 
Enzvmology 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57952. 

In light of an observed sequence homology between the DNA57952 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3129630, the Incyte EST clone 3129630 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 301 and is herein designated as DNA68869-1610. 
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Clone DNA68869-1610 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 26-28 and ending at the stop codon at nucleotide positions 4 10-412 (Figure 301). The 
predicted polypeptide precursor is 128 amino acids long (Figure 302). The full-length PRO 1385 protein shown 
in Figure 302 has an estimated molecular weight of about 13,663 daltons and a pi of about 10.97. Analysis of 
the full-length PR01385 sequence shown in Figure 302 (SEQ ID NO:420) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 28, and glycosylaminoglycan attachment 
sites from about amino acid 82 to about amino acid 85 and from about amino acid 91 to about amino acid 94. 
Clone DNA68869-1610 has been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 
203164. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 302 (SEQ ID NO:420), evidenced low homology 
between the PR01385 amino acid sequence and the following Dayhoff sequences: CELT14A8J, 
LMNACHRAll, HXD9_HUMAN, CHKCMUM, HS5PP34 2, DMDRINGJ, A37107J, 
MMLUNGENEl, PUM DROME and DMU251171. 

15 EXAMPLE 137 : Isolation of cDNA clones Encoding Human PRO 1387 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

20 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmoiogv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56259. 

25 In light of an observed sequence homology between the DNA56259 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3507924, the Incyte EST clone 3507924 was purchased 
and the c DNA insert was obtained and sequenced. It was founo) that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 303 and is herein designated as DNA68872-1620. 

Clone DNA68872-1620 contains a single open reading frame with an apparent translational initiation 

30 site at nucleotide positions 85-87 and ending at the stop codon at nucleotide positions 1267-1269 (Figure 303). 
The predicted polypeptide precursor is 394 amino acids long (Figure 304). The full-length PRO 1387 protein 
shown in Figure 304 has an estimated molecular weight of about 44,339 daltons and a pi of about 7.10. Analysis 
of the full-length PRO 13 87 sequence shown in Figure 304 (SEQ ID NO:422) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19, a transmembrane domain from about 

3 5 amino acid 275 to about amino acid 296, potential N-glycosylation sites from about amino acid 76 to about amino 
acid 79, from about amino acid 231 to about amino acid 234, from about amino acid 302 to about amino acid 
305, from about amino acid 307 to about amino acid 310 and from about amino acid 376 to about amino acid 
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379, and amino acid sequence blocks having homology to myelin pO protein from about amino acid 2 10 to about 
amino acid 239 and from about amino acid 92 to about amino acid 121. Clone DNA68872-1620 has been 
deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203160. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 304 (SEQ ID NO:422), evidenced significant 
homology between the PR01387 amino acid sequence and the following Dayhoff sequences: P W36955, 
MYPOHETFR, HS46KDAJ, AF049498J, MYO0_HUMAN. AF030454J, A53268, SHPTCRAJ, 
P W14146 and GEN 12838. 



FX AMPLE 138 : Isolation of cDNA clones E ncoding Human PRO 1384 
10 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA54192. Based on the DNA54192 

sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 

of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 1384. 
FCR primers (forward and reverse) were synthesized: 
15 forward PCR primer 5 ' -TGC AGCCCCTGTG AC AC A AACTGG -3 * (SEQ ID NO:425) 

reverse PCR primer 5-CTGAGATAACCGAGCCATCCTCCCAC-3' (SEQ ID NO:426) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA54192 

sequence which had the following nucleotide sequence: 

h/bridization probe 

20 5 ' -GG AG AT AGCTGCT ATGGGTTCTTC AGGC AC AACTT A AC ATGGG AAG-3 ' (SEQ ID NO:427) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PRO 1384 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver. 
25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01384 (designated herein as DNA71 159-1617 [Figure 305, SEQ ID NO:423J; and the derived protein 

sequence for PRO 1384. 

The entire coding sequence of PR01384 is shown in Figure 305 (SEQ ID NO:423). Clone DNA7 1 159- 
1617 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 

30 1 82- 1 84 and an apparent stop codon at nucleotide positions 869-87 1 . The predicted polypeptide precursor is 229 
amino acids long. The full-length PRO 1384 protein shown in Figure 306 has an estimated molecular weight of 
about 26,650 daltons and a pi of about 8.76. Additional features include a type II transmembrane domain at 
about amino acids 32-57, and potential N-glycosylation sites at about amino acids 68-71 , 120-123, and 134-137. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 306 (SEQ ID NO:424), revealed homology 
between the PR01384 amino acid sequence and the following Dayhoff sequences: AF054819_1 , HSAJ1687_1 , 
AF009511_1,AB010710_1,GEN13595,HSAJ673_1 ( GEN13961.AB005900_1,LECH_CHICK,AF021349_1, 
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and NK13RAT. 

Clone DNA71 159-1617 has been deposited with ATCC and is assigned ATCC deposit no. 203135. 

EXAMPLE 139 : Use of PRO as a hybridization probe 

The following method describes use of a nucleotide sequence encoding PRO as a hybridization probe. 
5 DN A comprising the coding sequence of full-length or mature PRO as disclosed herein is employed as 

a probe to screen for homologous DN As (such as those encoding narurally-occurring variants of PRO) in human 
tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following 
high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is performed in a 
10 solution of 50% formamide, 5x SSC, 0. 1 % SDS, 0. 1 % sodium pyrophosphate, 50 mM sodium phosphate, pH 
6.8, 2x Denhardt's solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of the filters is performed 
in an aqueous solution of 0. lx SSC and 0. 1 % SDS at 42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO can 
then be identified using standard techniques known in the an. 

15 

EXAMPLE 140 : Expression of PRO in E. coli 

This example illustrates preparation of an unglycosylated form of PRO by recombinant expression in 

E. coli. 

The DNA sequence encoding PRO is initially amplified using selected PCR primers. The primers 
20 should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from E. coli; see Bolivar et al., Gene, 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode 
25 for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, polyhis 
sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, and an 
argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described in 
Sambrook et al. , supra . Transformants are identified by their ability to grow on LB plates and antibiotic resistant 
30 colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are 
then grown to a desired optical density, during which the expression promoter is turned on. 
35 After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 

pellet obtained by the centrifugation can be solubilized using various agents known in the an, and the solubilized 
PRO protein can then be purified using a metal chelating column under conditions that allow tight binding of the 
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protein. 

PRO may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DNA 
encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
5 column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged sequences are then 
ligated into an expression vector, which is used to transform an £. coli host based on strain 52 (W3110 
fuhA(tonA) Ion galE rpoHts(htpRts) clpPOadq). Trans fonnants are first grown in LB containing 50 mg/ml 
carbenicillin at 30° C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold into 
CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium citrate»2H20, 1.07 g KC1, 5.36 g Difco 
10 yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 1 10 mM MPOS, pH 7.3, 0.55% (w/v) 
glucose and 7 mM MgSOJ and grown for approximately 20-30 hours at 30 °C with shaking. Samples are 
removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the cells. Cell 
pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 
15 guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0. 1M and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step results 
in a denatured protein with all cysteine residues blocked by sulfnolization. The solution is centrifuged at 40,000 
rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 
20 clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4°C. Protein concentration is estimated by its absorbance at 280 nm 
using the calculated extinction coefficient based on its amino acid sequence. 
25 The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 

of: 20 mM Tris, pH 8.6, 0.3 M NaCI, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. 
Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/rril. The 
refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition 
of TFA to a final concentration of 0.4 % (pH of approximately 3). Before further purification of the protein, the 
30 solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
refolded protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0.1 % 
TFA with elution with a gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance 
are analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. 
Generally, the properly refolded species of most proteins are etuted at the lowest concentrations of acetonitrile 
35 since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin 
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from the samples. 

Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 
0. 14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 
5 Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

FX AMPLE 141 : Expression of PRO in mam malian cells 

This example illustrates preparation of a potentially glycosylated form of PRO by recombinant 
expression in mammalian cells. 
10 The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 

Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the PRO 
DNA using ligation methods such as described in Sambrook et al., supra . The resulting vector is called pRK5- 
PRO. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 

15 grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 fig pRK5-PRO DNA is mixed with about 1 fig 
DNA encoding the VA RNA gene rThimmappaya et al., Cell, 31:543 (1982)) and dissolved in 500 /d of 1 mM 
Tris-HCl, 0.1 mM EDTA, 0.227 M CaCU. To this mixture is added, dropwise, 500 fi\ of 50 mM HEPES (pH 
7.35), 280 mM NaCl, 1.5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 

20 precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37°C. The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture 
medium (alone) or culture medium containing 200 fiCi/ml 3i S-cysteine and 200 fiCi/ml "S-methionine. After 

25 a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15% 
SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of PRO polypeptide. The cultures containing transfected cells may undergo further incubation (in 
serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran sulfate 

30 method described by Somparyrac et al., Proc. Natl. Acad. Sci. . 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 /ig pRK5-PRO DNA is added. The cells are first concentrated from 
the spinner flask by centrifugation and washed with PBS. The DN A-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium. 5 (ig/m\ bovine insulin and 

35 0. 1 fig/ml bovine transferrin. After about four days, the conditioned media is centrifuged and filtered to remove 
cells and debris. The sample containing expressed PRO can then be concentrated and purified by any selected 
method, such as dialysis and/or column chromatography. 
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In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected into 
CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the cell cultures can 
be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 
3s S-methionine. After determining the presence of PRO polypeptide, the culture medium may be replaced with 
serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium 
5 is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selected 
method. 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly- 
his tag into a Baculovirus expression vector. The poly-his tagged PRO insert can then be subcloned into a SV40 
1 0 driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described 
above, to verify expression. The culture medium containing the expressed poly-His tagged PRO can then be 
concentrated and purified by any selected method, such as by Ni ;+ -chelate affinity chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO 
15 cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular 
domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 and 
CH2 domains and/or is a poly-His tagged form. 
20 Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 

standard techniques as described in Ausubel et ah, Current Protocols of Molecular Biology. Unit 3.16, John 
Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' 
of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells 
is as described in Lucas et al. f Nuci. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 
25 promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO cells 
using commercially available transfection reagents Superfect* (Quiagen), Dosper* or Fugene* (Bochringer 
Mannheim). The cells are grown as described in Lucas et al., supra . Approximately 3 x 10' 7 cells are frozen 
30 in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vonexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
^m filtered PS20 with 5% 0.2 A<m diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 
35 spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 mL selective growth medium and incubated at 37?C. After another 2-3 days, 250 mL, 500 mL 
and 2000 mL spinners are seeded with 3 x 10 s cells/mL. The cell media is exchanged with fresh media by 
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centrifugation and resuspension in produciion medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 
3L production spinner is seeded at 1 .2 x 10 6 cells/mL. On day 0, the cell number pH ie determined. On day 
1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the 
temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% 
5 polydimethylsiloxane emulsion, Dow Corning 365 Medical Grade Emulsion) taken. Throughout the production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 
70 % , the cell culture is harvested by centrifugation and filtering through a 0.22 ^m filter. The filtrate was either 
stored at 4°C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 

10 purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media 
is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl 
and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCl and 4% 

15 mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 

20 ml fractions into tubes containing 275 (XL of 1 M Tris buffer, pH 9. The highly purified protein is subsequently 
desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed 
by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edraan degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

25 EXAMPLE 142 : Expression of PRO in Yeast 

The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PRO from 
the ADH2/GAPDH promoter. DN A encoding PRO and the promoter is inserted into suitable restriction enzyme 
sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding PRO can 

30 be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native PRO 
signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory 
signal/leader sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain ABl 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supematants can be analyzed by 

35 precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 
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Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may further be purified using selected column chromatography resins. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



5 E XAMPLE 143 : Expression of PRO in Baculovirus-Infected Insect Cells 

The following method describes recombinant expression of PRO in Baculovirus-infected insect cells. 
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly -his tags and immunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 

10 as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the coding sequence of 
PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5' and 3' regions. The 5* primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 

15 Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus 

DNA (Phanningen) into Spodoptera frugiperda ("Sf9") cells (ATCC CRL 171 1) using lipofeciin (commercially 
available from GIBCO-BRL). After 4 - 5 days of incubation at 28°C, the released viruses are harvested and used 
for further amplifications. Viral infection and protein expression are performed as described by O'Reilley et 
al., Baculovirus expression vectors: A Laboratory Manual. Oxford: Oxford University Press (1994). 

20 Expressed poly-his tagged PRO can then be purified, for example, by Ni 2+ -chelate affinity 

chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
Rupert et al., Nature . 362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes, pH 7.9; 12.5 mM MgCl 2 ; 0. 1 raM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KCI), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 

25 in loading buffer (50 mM phosphate, 300 mM NaCi, 10% glycerol, pH 7.8) and filtered through a 0.45 /im 
filter. A Ni :+ -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 5 
mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the column at 0.5 mL per minute. The column is washed to baseline A 280 with loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 

30 phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching 
A 2W baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the elutcd His 10 -tagged PRO are 
pooled and dialyzed against loading buffer. 

35 Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 

chromatography techniques, including for instance. Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 
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FX AMPLE 144 : Preparation of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 

Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Goding, supra . Immunogens that may be employed include purified PRO, fusion proteins containing 
PRO, and cells expressing recombinant PRO on the cell surface. Selection of the immunogen can be made by 
5 the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and 
injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 
10 additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with additional immunization injections. Serum samples may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to detect ami-PRO antibodies. 

After a suitable antibody titer has been detected, the animals •positive" for antibodies can be injected 
with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells 
15 are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 
which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 
thymidine) medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 
20 "positive* hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 
ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
25 affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

EXAMPLE 145 : Purification of PRO Polypeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the art 

of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide 
30 is purified by immunoaffmity chromatography using antibodies specific for the PRO polypeptide of interest. In 

general, an immunoaffmity column is constructed by covalently coupling the anti-PRO polypeptide antibody to 

an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 

sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). 
35 Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 

chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 

chromatographic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody 
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is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an immuno affinity column is utilized in the purification of PRO polypeptide by preparing a fraction 
from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of the 
whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by 
5 other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO polypepn^e^ontaining preparation is passed over the immunoaffiniry column, and the 
column is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g., high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
10 antibody/PRO polypeptide binding (e. g. , a low pH buffer such as approximately pH 2-3 , or a high concentration 
of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

EXAMPLE 146 : Drug Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or binding 

15 fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened 
against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be 

20 used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can 
affect a PRO polypeptide -associated disease or disorder. These methods comprise contacting such an agent with 

25 an PRO polypeptide or fragment thereof and assaying (1) for the presence of a complex between the agent and 
the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment 
and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or 
fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that 
present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular 

30 agent to bind to PRO polypeptide or to interfere with the PRO polypeptide/cell complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. 
Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such 
as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted 

35 with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the an. 
Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the 
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solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with PRO polypeptide. 

5 

EXAMPLE 147 : Rational Drue Design 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide of 
interest (i.e. , a PRO polypeptide) or of small molecules with which they interact, e.g. , agonists, antagonists, or 
inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the 

1 0 PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide in vivo (c./ , Hodgson, 
BiofTechnoloev. 9: 19-21 (1991)). 

In one approach, the three-dimensional structure of the PRO polypeptide, or of an PRO 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, 
by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must be ascertained 

15 to elucidate the structure and to determine active site(s) of the molecule. Less often, useful information regarding 
the structure of the PRO polypeptide may be gained by modeling based on the structure of homologous proteins. 
In both cases, relevant structural information is used to design analogous PRO polypeptide -like molecules or to 
identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved 
activity or stability as shown by Braxton and Wells, Biochemistry. 31:7796-7801 (1992) or which act as 

20 inhibitors, agonists, or antagonists of native peptides as shown by Athauda et a/., J. Biochem.. ]_L3* 74 2-746 
(1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, 
and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogether by generating ami -idiotypic 

25 antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, 
the binding site of the anti-ids would be expected to be an analog of the original receptor. The anii-id could then 
be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated 
peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made available 

30 to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PRO polypeptide 
amino acid sequence provided herein will provide guidance to those employing computer modeling techniques 
in place of or in addition to x-ray crystallography. 
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Deposit of Materia] 

The following materials have been deposited with the American Type Culture Collection, 10801 
University Blvd., Manassas, VA 20110-2209, USA (ATCC): 



Table 2 



15 



20 



25 



30 



35 



40 



Materia! 


ATCC Deo. No. 


Deposit D 


UNA 16422- 1209 




June 2, 1998 


DNA 16435- 1208 




June 2, 1998 


DNA2 1624- 1391 


209917 


June 2, 1998 


DNA23334-1392 


209918 


I ^ 1 Art A 

June 2, 1998 


DNA26288-1239 


209792 


A *1 T 1 1 AAA 

April 21, 1998 


DNA26843-1389 


203099 


August 4, 1998 


DNA26844-1394 


209926 


June 2, 1998 


DNA30862-1396 


209920 


June 2, 1998 


DNA35680-1212 


AAAf AA 

209790 


April 21, 1998 


DNA40621-1440 


209922 


June 2, 1998 


DNA44 161-1434 


209907 


May 27, 1998 


DNA44694-1500 


203114 


August 11, 1998 


DNA45495-1550 


203156 


August 25, 1998 


DNA47361-1154 


209431 


November 7, 1997 


DNA47394-1572 


203109 


August 11, 1998 


DNA48320-1433 


209904 


May 27, 1998 


DNA48334-1435 


209924 


June 2, 1998 


DNA48606-1479 


203040 


July 1, 1998 


DNA49141-1431 


203003 


June 23, 1998 


DNA49 142- 1430 


203002 


June 23, 1998 


DNA49 143- 1429 


203013 


June 23, 1998 


DNA49647-1398 


209919 


June 2. 1998 


DNA49819-1439 


A AAA'S « 

209931 


June 2, 1998 


DNA49820-1427 


209932 


June 2, 1998 


DNA4982 1-1562 


209981 


June 16, 1998 


DNA52192-1369 


A A^ ^\ J A 

203042 


July 1, 1998 


DNA52598-1518 


203107 


August 11, 1998 


DNA53913-1490 


203162 


August 25, 1998 


DNA53978-1443 


a aaaa* 1 * 

209983 


June 16, 1998 


DNA53996-I442 


209921 


June 2, 1998 


DNA56041-1416 


A A.1 A. * a 

203012 


June 23, 1998 


DN A56047- 1 456 


209948 


June 9, 1998 


iJvi ADOUjU- 14jj 


20301 1 


t a ^ 1 aaa 

June 23, 1998 


L)NA5ol 10-1437 


203113 


August ll, 1998 


HMA 11 mo 

UlNADoJ li-l j/o 


203049 


T 1 # 4 AAA 

July 1, 1998 


DNA56410-1414 


^AAA/^1 

209923 


June 2, 1998 


1J IN Ajo4 jo- 144 8 


209902 


May 27, 1998 


DNA56855-1447 


203004 


June 23, 1998 


DNA56859-1445 


203019 


June 23, 1998 


DNA56860-1510 


209952 


June 9, 1998 


DNA56865-1491 


203022 


June 23, 1998 


DNA56866-1342 


203023 


June 23, 1998 


DNA56868-1209 


203024 


June 23, 1998 


DNA56869-1545 


203161 


August 25, 1998 


DNA56870-1492 


209925 


June 2, 1998 


DNA57033-1403 


209905 


May 27. 1998 


DNA57037-1444 


209903 


May 27, 1998 


DNA57129-1413 


209977 


June 16, 1998 
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DNA57690-1374 


209950 


June 9, 1998 


DNA57693-1424 


203008 


June 23, 1998 


DNA57694-1341 


203017 


June 23, 1998 


DNA57695-1340 


203006 


June 23, 1998 


DNA57699-1412 


203020 


June 23, 1998 


DNA57702-1476 


209951 


June 9, 1998 


DNA57704-1452 


209953 


June 9, 1998 


DNA57708-1411 


203021 


June 23, 1998 


DNA57710-1451 


203048 


July 1, 1998 


DNA57711-1501 


203047 


July 1, 1998 


DNA57827-1493 


203045 


July 1, 1998 


DNA57834-1339 


209954 


June 9, 1998 


DNA57836-1338 


203025 


June 23, 1998 


DNA57838-1337 


203014 


June 23, 1998 


DNA57844-1410 


203010 


June 23, 1998 


DNA58721-1475 


203110 


August 11, 1998 


DNA58723-1588 


203133 


August 18, 1998 


DNA58737-1473 


203136 


August 18, 1998 


DNA58743-1609 


203154 


August 25, 1998 


DNA58846-1409 


209957 


June 9, 1998 


DNA58848-1472 


209955 


June 9, 1998 


DNA58849-1494 


209958 


June 9, 1998 


DNA58850-1495 


209956 


June 9, 1998 


DNA58853-1423 


203016 


June 23, 1998 


DNA58855-1422 


203018 


June 23, 1998 


DNA59205-1421 


203009 


June 23, 1998 


DNA592 11-1450 


209960 


June 9, 1998 


DNA5 9213- 1487 


209959 


June 9, 1998 


DNA59214-1449 


203046 


July 1, 1998 


DNA59215-1425 


209961 


June 9, 1998 


DNA59220-1514 


209962 


June 9, 1998 


DNA59488-1603 


203157 


August 25, 1998 


DNA59493-1420 


203050 


July I, 1998 


DNA59497-1496 


209941 


June 4, 1998 


DNA59588-1571 


203106 


August 11, 1998 


DNA59603-1419 


209944 


June 9, 1998 


DNA59605-1418 


203005 


June 23, 1998 


DNA59606-1471 


209945 


June 9, 1998 


DNA59607-1497 


209957 


June 9, 1998 


DNA59609-1470 


209963 


June 9, 1998 


DNA59610-1559 


209990 


June 16, 1998 


DNA59612-1466 


209947 


June 9, 1998 


DNA59613-1417 


203007 


June 23, 1998 


DNA59616-1465 


209991 


June 16, 1998 


DNA59619-1464 


203041 


July 1, 1998 


DNA59620-1463 


209989 


June 16, 1998 


DNA59625-1498 


209992 


June 17, 1998 


DNA59767-1489 


203108 


August 11, 1998 


DNA59776-1600 


203128 


August 18, 1998 


DNA59777-1480 


203111 


August 11, 1998 


DNA59820-1549 


203129 


August 18, 1998 


DNA59827-1426 


203089 


August 4, 1998 


DNA59828-1608 


203158 


August 25, 1998 


DNA59838-1462 


209976 


June 16, 1998 


DNA59839-1461 


209988 


June 16, 1998 


DNA59841-1460 


203044 


July 1, 1998 


DNA59842-1502 


209982 


June 16, 1998 
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DNA59846-1503 


209978 


June 16 t 1998 


DNA59847-1511 


203098 


August 4, 1998 


DNA59848-1512 


203088 


August 4, 1998 


DNA59849-1504 


209986 


June 16, 1998 


DNA59853-1505 


209985 


June 16, 1998 


DNA59854-1459 


209974 


June 16, 1998 


DNA60283-1484 


203043 


July 1, 1998 


DNA60615-1483 


209980 


June 16, 1998 


DNA60619-1482 


209993 


June 16, 1998 


DNA60621-1516 


203091 


August 4, 1998 


DNA60622-1525 


203090 


August 4, 1998 


DNA60625-1507 


209975 


June 16, 1998 


DNA60627-1508 


203092 


August 4, 1998 


DNA60629-1481 


209979 


June 16, 1998 


DNA61755-1554 


203112 


August 11, 1998 


DNA61873-1574 


203132 


August 18, 1998 


DNA62814-1521 


203093 


August 4, 1998 


DNA62872-1509 


203100 


August 4, 1998 


DNA62876-1517 


203095 


August 4, 1998 


DNA62881-1515 


203096 


August 4, 1998 


DNA64 852-1589 


203127 


August 18, 1998 


DNA64884-1527 


203155 


August 25, 1998 


DNA64890-1612 


203131 


August 18,' 1998 


DNA654I2-1523 


203094 


August 4, 1998 


DNA66308-1537 


203159 


August 25, 1998 


DNA66309-1538 


203235 


September 15, 1998 


DNA67004-16I4 


203115 


August 11, 1998 


DNA68869-1610 


203164 


August 25, 1998 


DNA68872-1620 


203160 


August 25, 1998 


DNA71159-16I7 


203135 


August 18, 1998 



These deposit were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 
between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of 
the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the 
public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny 
to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 
USC §122 and the Commissioner's rules pursuant thereto (including 37 CFR §1.14 with particular reference to 
886 OG 638). 

The assignee of the present application has agreed that if a culture of the materials on deposit should 
die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 
notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the an to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
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deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the an from the foregoing 
description and fall within the scope of the appended claims. 
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WHAT IS CLAIMED IS : 

1. Isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that 
encodes a polypeptide comprising an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 
(SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ ID NO:23), Figure 18 (SEQ ID NO:28), Figure 
5 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 (SEQ ID NO:36), Figure 27 (SEQ ID NO:41), 
Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID 
NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), Figure 47 (SEQ ID NO: 84), Figure 49 (SEQ 
ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 
(SEQ ID NO: 113), Figure 66 (SEQ ID NO: 115), Figure 68 (SEQ ID NO: 117), Figure 70 (SEQ ID NO: 1 19), 

10 Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID 
NO:138), Figure 83 (SEQ ID NO:146), Figure 85 (SEQ rD NO:148). Figure 88 (SEQ ID NO:15i), Figure 90 
(SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), 
Figure 99 (SEQ ID NO:165), Figure 101 (SEQ ID NO:167), Figure 103 (SEQ ID NO:169), Figure 105 (SEQ 
ID NO:17I), Figure 109 (SEQ ID NO:175), Figure 111 (SEQ ID NO:177), Figure 113 (SEQ ID NO:179), 

15 Figure 1 15 (SEQ ID NO: 181), Figure 1 17 (SEQ ID NO: 183), Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ 
ID NO: 194), Figure 125 (SEQ ID NO: 197), Figure 127 (SEQ ID NO: 199), Figure 129 (SEQ ID NO:201). 
Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207), Figure 137 (SEQ 
ID NO:209), Figure 139 (SEQ ID N0:21I), Figure 141 (SEQ ID NO:2I3), Figure 144 (SEQ ID NO:216), 
Figure i47 (SEQ ID NO:219), Figure 149 (SEQ ID N0.221), Figure 151 (SEQ ID N0:223), Figure 153 (SEQ 

20 ID N0:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ ID N0:229), Figure 159 (SEQ ID NO:231), 
Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), Figure 165 (SEQ ID NO:246), Figure 167 (SEQ 
ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID NO:256), 
Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), Figure 180 (SEQ ID NO:262), Figure 182 (SEQ 
ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ ID NO:268), Figure 188 (SEQ ID NO:270), 

25 Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276), Figure 196 (SEQ 
ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285), 
Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291), Figure 210 (SEQ 
ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299), 
Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303), Figure 226 (SEQ ID NO:309), Figure 228 (SEQ 

30 ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), 
Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ 
ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ ID NO:353), Figure 250 (SEQ ID NO:355), 
Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), Figure 256 (SEQ ID NO:361), Figure 258 (SEQ 
ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369), 

35 Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ 
ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:38l), Figure 278 (SEQ ID NO:387), 
Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399), Figure 286 (SEQ 
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ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID KO:410), 
Figure 294 (SEQ ID NO:4l2), Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:416), Figure 300 (SEQ 
ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ ID NO:422) and Figure 306 (SEQ ID NO:424). 

2. The nucleic acid sequence of Claim 1 , wherein said nucleotide sequence comprises a nucleotide 
5 sequence selected from the group consisting of the sequence shown in Figure 1 (SEQ ID NO: 1), Figure 3 (SEQ 
ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO:13), Figure 11 (SEQ ID NO:19), Figure 14 (SEQ 
ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ ID NO:32), Figure 24 
(SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 (SEQ ID NO:51), Figure 
33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 40 (SEQ ID NO:72), 
10 Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), Figure 52 (SEQ ID 
NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO:112), Figure 65 (SEQ ID NO: 114), Figure 67 
(SEQ ID NO:116), Figure 69 (SEQ ID N0:118), Figure 71 (SEQ ID NO:123). Figure 73 (SEQ ID NO:128), 
Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), Figure 84 (SEQ ID 
NO:147), Figure 87 (SEQ ID NO:150), Figure 89 (SEQ ID NO:152), Figure 92 (SEQ ID NO:155), Figure 94 

15 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164). Figure 100 (SEQ ID NO: 166), 
Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID NO: 174), Figure 1 10 (SEQ 
ID NO:176), Figure 112 (SEQ ID NO:178), Figure 114 (SEQ ID NO:180), Figure 116 (SEQ ID NO:182), 
Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID NO:193), Figure 124 (SEQ ID NO:196), Figure 126 (SEQ 
ID NO:198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 132 (SEQ ID NO:204), 

20 Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID NO:210), Figure 140 (SEQ 
ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 148 (SEQ ID NO:220), 
Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID NO:226), Figure 156 (SEQ 
ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 162 (SEQ ID NO:240), 
Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID NO:249). Figure 170 (SEQ 

25 ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 177 (SEQ ID NO:259), 
Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID NO:265), Figure 185 (SEQ 
ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 191 (SEQ ID NO:273), 
Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID NO:280), Figure 199 (SEQ 
ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 205 (SEQ ID NO:288), 

30 Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID NO:294), Figure 213 (SEQ 
ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 219 (SEQ ID NO:302), 
Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID NO:318), Figure 232 (SEQ 
ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 239 (SEQ ID NO:344), 
Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID NO:350), Figure 247 (SEQ 

35 ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 253 (SEQ ID NO:358), 
Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID NO:364), Figure 261 (SEQ 
ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 267 (SEQ ID NO:372), 
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Figure 269 (SEQ ID NO:374). Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID NO:378), Figure 275 (SEQ 
ID NO:380). Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 281 (SEQ ID NO:393), 
Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID NO:402), Figure 289 (SEQ 
ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411), Figure 295 (SEQ ID NO:413) ( 
Figure 297 (SEQ ID NO:4l5), Figure 299 (SEQ ID NO:417) T Figure 301 (SEQ ID NO:419), Figure 303 (SEQ 
5 ID NO:421) and Figure 305 (SEQ ID NO:423). 

3. The nucleic acid of Claim 1 , wherein said nucleotide sequence comprises a nucleotide sequence 
selected from the group consisting of the full-length coding sequence of the sequence shown in Figure 1 (SEQ 
ID NO:l). Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO:13), Figure 11 (SEQ ID 

10 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61). Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

15 NO: 114), Figure 67 (SEQ ID NO: 116), Figure 69 (SEQ ID NO: 1 18), Figure 71 (SEQ ID NO:123), Figure 73 
(SEQ ID NO:128), Figure 75 (SEQ ID NO:134), Figure 78 (SEQ ID NO:137), Figure 82 (SEQ ID NO:145), 
Figure 84 (SEQ ID NO:147), Figure 87 (SEQ ID NO:150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO:155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO:164), Figure 100 
(SEQ ID NO:166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO:170), Figure 108 (SEQ ID 

20 NO: 174), Figure 110 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 114 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO:182), Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID NO:193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

25 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO;245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 

30 NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 

35 219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:3I3), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333) ( Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
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NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366). Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO;370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 
5 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:4I I), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
N0 419), Figure 303 (SEQ ID NO:421) or Figure 305 (SEQ ID NO:423). 

10 4. Isolated nucleic acid which comprises the full-length coding sequence of the DNA deposited 

under any ATCC accession number shown in Table 2. 

5. A vector comprising the nucleic acid of Claim 1. 

15 6. The vector of Claim 5 operabiy linked to control sequences recognized by a host cell 

transformed with the vector. 

7. A host cell comprising the vector of Claim 5. 

20 8. The host cell of Claim 7 wherein said cell is a CHO cell. 

9. The host cell of Claim 7 wherein said cell is an £. colt. 

10. The host cell of Claim 7 wherein said cell is a yeast cell. 

25 

11. A process for producing a PRO polypeptides comprising culturing the host cell of Claim 7 
under conditions suitable for expression of said PRO polypeptide -and recovering said PRO polypeptide from the 
cell culture. 

30 12. Isolated PRO polypeptide having at least 80% sequence identity to an amino acid sequence 

selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID NO:2) t Figure 4 (SEQ 
ID NO:6), Figure 6 (SEQ ID NO:8) t Figure 9 (SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ 
ID NO:23), Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 
(SEQ ID NO:36), Figure 27 (SEQ ID NO:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 

3 5 34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), 
Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID 
NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID NO: 113), Figure 66 (SEQ ID NO:I15), Figure 68 
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(SEQ ID NO: 1 17), Figure 70 (SEQ ID NO: 119), Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), 
Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID NO: 138), Figure 83 (SEQ ID NO: 146), Figure 85 (SEQ ID 
NO: 148), Figure 88 (SEQ ID NO: 151), Figure 90 (SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 
(SEQ ID NO:I58), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID NO:165), Figure 101 (SEQ ID NO:167), 
Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 111 (SEQ 
5 ID NO: 177), Figure 113 (SEQ ID NO: 179), Figure 115 (SEQ ID NO: 181), Figure 117 (SEQ ID NO: 183), 
Figure 120 (SEQ ID NO: 189). Figure 122 (SEQ ID NO:194), Figure 125 (SEQ ID NO:197), Figure 127 (SEQ 
ID NO: 199). Figure 129 (SEQ ID NO:201) f Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), 
Figure 135 (SEQ IDNO:207), Figure 137 (SEQ ID NO:209), Figure 139 (SEQ IDNO:211), Figure 141 (SEQ 
ID NO:213), Figure 144 (SEQ ID NO:216), Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), 

10 Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ 
ID NO:229), Figure 159 (SEQ ID NO:231), Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), 
Figure 165 (SEQ ID NO:246), Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ 
ID NO:253), Figure 174 (SEQ ID NO:256), Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), 
Figure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ 

15 ID NO:268), Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), 
Figure 194 (SEQ ID NO:276), Figure 196 (SEQ ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ 
ID NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), 
Figure 208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ 
ID NO:297), Figure 216 (SEQ ID NO;299), Figure 218 (SEQ ID NO:30I), Figure 220 (SEQ ID NO:303), 

20 Figure 226 (SEQ ID NO:309). Figure 228 (SEQ ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ 
ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), 
Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ 
ID NO:353), Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), 
Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ 

25 ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), 
Figure 270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ 
ID NO:381), Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394), 
Figure 284 (SEQ ID NO:399), Figure 286 (SEQ ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ 
ID NO:408), Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), 

30 Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID N0.418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ 
ID NO:422) and Figure 306 (SEQ ID NO:424). 

13. Isolated PRO polypeptide having at least 80% sequence identity to the amino acid sequence 
encoded by a nucleic acid molecule deposited under any ATCC accession number shown in Table 2. 

35 

14. A chimeric molecule comprising a polypeptide according to Claim 12 fused to a heterologous 
amino acid sequence. 
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15. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is an 
epitope lag sequence. 

16. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is a Fc 
region of an immunoglobulin. 

5 

17. An antibody which specifically binds 10 a PRO polypeptide according to Claim 12. 

18. The antibody of Claim 17 wherein said antibody is a monoclonal antibody. 
10 19. The antibody of Claim 17 wherein said antibody is a humanized antibody. 

20. The antibody of Claim 17 wherein said antibody is an antibody fragment. 

21 . An isolated nucleic acid molecule which has at least 80% sequence identity to a nucleic acid 
15 which comprises a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 

NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 
NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
vSfcQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 

20 40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 
NO: 114), Figure 67 (SEQ ID NO: 116), Figure 69 (SEQ ID NO: 1 18), Figure 71 (SEQ ID NO: 123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO:147), Figure 87 (SEQ ID NO:150), Figure 89 (SEQ ID NO:152), Figure 92 (SEQ ID 

25 NO: 155), Figure 94 (SEQ ID NO:I57), Figure 96 (SEQ ID NO:159), Figure 98 (SEQ ID N0:164), Figure 100 
(SEQ ID NO:166), Figure 102 (SEQ ID NO:168), Figure 104 (SEQ ID NO:I70), Figure 108 (SEQ ID 
NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 112 (SEQ ID NO: 178), Figure 114 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 

30 132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208). Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 
148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 

35 NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
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191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO: 280). Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286). Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
5 NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360) t Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 

10 267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 
281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 

15 NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

22. An isolated nucleic acid molecule which has at least 80% sequence identity to the full-length 
coding sequence of a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 
NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 

20 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61) ) Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

25 NO: 1 14), Figure 67 (SEQ ID NO:l 16), Figure 69 (SEQ ID NO:l 18), Figure 71 (SEQ ID NO:123), Figure 73 
(SEQ ID N0.128), Figure 75 (SEQ ID NO:134), Figure 78 (SEQ ID NO:137), Figure 82 (SEQ ID N0.145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

30 NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID NO:193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO:198), Figure 128 (SEQ ID NO;200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

35 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222). Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
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NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257). Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269) f Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273) t Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
5 205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO;292) t Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313). Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 

10 NO:350) f Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386). Figure 279 (SEQ ID NO:388), Figure 

15 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:42I) and Figure 305 (SEQ ID NO:423). 

20 23. An isolated extracellular domain of of PRO polypeptide. 

24. An isolated PRO polypeptide lacking its associated signal peptide. 

25. An isolated polypeptide having at least about 80% amino acid sequence identity to an 
25 extracellular domain of of PRO polypeptide. 

26. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO 
polypeptide lacking its associated signal peptide. 
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FIGURE 1 

CGGACGCGTGGGTGCGAGGCGAAGGTGACCGGGGACCGAGCATTTCAGATCTGCTCGGTAGA 

CCTGGTGCACCACCACCATGTTGGCTGCAAGGCTGGTGTGTCTCCGGACACTACCTTCTAGG 

GTTTTCCACCCAGCTTTCACCAAGGCCTCCCCTGTTGTGAAGAATTCCATCACGAAGAATCA 

ATGGCTGTTAACACCTAGCAGGGAATATGCCACCAAAACAAGAATTGGGATCCGGCGTGGGA 

GAACTGGCCAAGAACTCAAAGAGGCAGCATTGGAACCATCGATGGAAAAAATATTTAAAATT 

GATCAGATGGGAAGATGGTTTGTTGCTGGAGGGGCTGCTGTTGGTCTTGGAGCATTGTGCTA 

CTATGGCTTGGGACTGTCTAATGAGATTGGAGCTATTGAAAAGGCTGTAATTTGGCCTCAGT 

ATGTCAAGGATAGAATTCATTCCACCTATATGTACTTAGCAGGGAGTATTGGTTTAACAGCT 

TTGTCTGCCATAGCAATCAGCAGAACGCCTGTTCTCATGAACTTCATGATGAGAGGCTCTTG 

GGTGACAATTGGTGTGACCTTTGCAGCCATGGTTGGAGCTGGAATGCTGGTACGATCAATAC 

CATATGACCAGAGCCCAGGCCCAAAGCATCTTGCTTGGTTGCTACATTCTGGTGTGATGGGT 

GCAGTGGTGGCTCCTCTGACAATATTAGGGGGTCCTCTTCTCATCAGAGCTGCATGGTACAC 

AGCTGGCATTGTGGGAGGCCTCTCCACTGTGGCCATGTGTGCGCCCAGTGAAAAGTTTCTGA 

ACATGGGTGCACCCCTGGGAGTGGGCCTGGGTCTCGTCTTTGTGTCCTCATTGGGATCTATG 

TTTCTTCCACCTACCACCGTGGCTGGTGCCACTCTTTACTCAGTGGCAATGTACGGTGGATT 

AGTTCTTTTCAGCATGTTCCTTCTGTATGATACCCAGAAAGTAATCAAGCGTGCAGAAGTAT 

CACCAATGTATGGAGTTCAAAAATATGATCCCATTAACTCGATGCTGAGTATCTACATGGAT 

ACATTAAATATATTTATGCGAGTTGCAACTATGCTGGCAACTGGAGGCAACAGAAAGAAATG 

AAGTGACTCAGCTTCTGGCTTCTCTGCTACATCAAATATCTTGTTTAATGGGGCAGATATGC 

ATTAAATAGTTTGTACAAGCAGCTTTCGTTGAAGTTTAGAAGATAAGAAACATGTCATCATA 

TTTAAATGTTCCGGTAATGTGATGCCTCAGGTCTGCCTTTTTTTCTGGAGAATAAATGCAGT 

AATCCTCTCCCTU^ATAAGCACACACATTTTCAATTCTCATGTTTGAGTGATTTTAAAATGTT 

TTGGTGAATGTGAAAACTAAAGTTTGTGTCATGAGAATGTAAGTCTTTTTTCTACTTTAAAA 

TTTAGTAGGTTCACTGAGTAACTAAAATTTAGCAAACCTGTGTTTGCATATTTTTTTGGAGT 

GCAGAATATTGTAATTAATGTCATAAGTGATTTGGAGCTTTGGTAAAGGGACCAGAGAGAAG 

GAGTCACCTGCAGTCTTTTGTTTTTTTAAATACTTAGAACTTAGCACTTGTGTTATTGATTA 

GTGAGGAGCCAGTAAGAAACATCTGGGTATTTGGAAACAAGTGGTCATTGTTACATTCATTT 

GCTGAACTTAACAAAACTGTTCATCCTGAAACAGGCACAGGTGATGCATTCTCCTGCTGTTG 

CTTCTCAGTGCTCTCTTTCCAATATAGATGTGGTCATGTTTGACTTGTACAGAATGTTAATC 

ATACAGAGAATCCTTGATGGAATTATATATGTGTGTTTTACTTTTGAATGTTACAAAAGGAA 

AT AACTTT AAAACT AT T CT C AAGAG AAAAT AT TC AAAG CATG AAAT AT GT TG CTTTTT C CAG 

AATACAAACAGTATACTCATG 
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FIGURE 2 

MLAARLVCLRTLPSRVFHPAFTKASPVVKNS I TKNQWLLTPSREYATKTRIG IRRGRTGQEL 
KEAALEPSMEKIFKIDQMGRWFVAGGAAVGLGALCYYGLGLSNEIGAIEKAVIWPQYVKDRI 
HSTYMYLAGS IGLTALSAI AI SRTPVLMNFMMRGSWVT I GVTFAAMVGAGMLVRS I PYDQS P 
GPKHIAVTLLHSGVMGAWAPLTILGGPLLIR 

GVGLGLVF VS SLGSMFLP PTTVAGATLYSVAMYGGLVLFSMFLLYDTQKVI KRAEVS PMYGV 
QKYDPINSMLSIYMDTLNIFMRVATMLATGGNRKK 
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FIGURE 3 

GAAGGCTGCCTCGCTGGTCCGAATTCGGTGGCGCCACGTCCGCCCGTCTCCGCCTTCTGCAT 

CGCGGCTTCGGCGGCTTCCACCTAGACACCTAACAGTCGCGGAGCCGGCCGCGTCGTGAGGG 

GGTCGGCACGGGGAGTCGGGCGGTCTTGTGCATCTTGGCTACCTGTGGGTCGAAGATGTCGG 

ACATCGGAGACTGGTTCAGGAGCATCCCGGCGATCACGCGCTATTGGTTCGCCGCCACCGTC 

GCCGTGCCCTTGGTCGGCAAACTCGGCCTCATCAGCCCGGCCTACCTCTTCCTCTGGCCCGA 

AGCCTTCCTTTATCGCTTTCAGATTTGGAGGCCAATCACTGCCACCTTTTATTTCCCTGTGG 

GTCCAGGAACTGGATTTCTTTATTTGGTCAATTTATATTTCTTATATCAGTATTCTACGCGA 

CTTGAAACAGGAGCTTTTGATGGGAGGCCAGCAGACTATTTATTCATGCTCCTCTTTAACTG 

GATTTGCATCGTGATTACTGGCTTAGCAATGGATATGCAGTTGCTGATGATTCCTCTGATCA 

TGTCAGTACTTTATGTCTGGGCCCAGCTGAACAGAGACATGATTGTATCATTTTGGTTTGGA 

ACACGATTTAAGGCCTGCTATTTACCCTGGGTTATCCTTGGATTCAACTATATCATCGGAGG 

CTCGGTAATCAATGAGCTTATTGGAAATCTGGTTGGACATCTTTATTTTTTCCTAATGTTCA 

GATACCCAATGGACTTGGGAGGAAGAAATTTTCTATCCACACCTCAGTTTTTGTACCGCTGG 

CTGCCCAGTAGGAGAGGAGGAGTATCAGGATTTGGTGTGCCCCCTGCTAGCATGAGGCGAGC 

TGCTGATCAGAATGGCGGAGGCGGGAGACACAACTGGGGCCAGGGCTTTCGACTTGGAGACC 

AGTGAAGGGGCGGCCTCGGGCAGCCGCTCCTCTCAAGCCACATTTCCTCCCAGTGCTGGGTG 

CACTTAACAACTGCGTTCTGGCTAACACTGTTGGACCTGACCCACACTGAATGTAGTCTTTC 

AGTACGAGACAAAGTTTCTTAAATCCCGAAGAAAAATATAAGTGTTCCACAAGTTTCACGAT 

TCTCATTCAAGTCCTTACTGCTGTGAAGAACT^TACCAACTGTGCAAATTGCAAAACTGAC 

TACATTTTTTGGTGTCTTCTCTTCTCCCCTTTCCGTCTGAATAATGGGTTTTAGCGGGTCCT 

AATCTGCTGGt^TTGAGCTGGGGCTGGGTCACCAAACCCTTCCCAAAAGGACCTTATCTCTT 

TCTTGCACACATGCCTCTCTCCCACTTTTCCCAACCCCCACATTTGCAACTAGAAAAAGTTG 

CCCATAAAATTGCTCTGCCCTTGACAGGTTCTGTTATTTATTGACTTTTGCCAAGGCTGGTC 

ACAACAATCATATTCACGTTATTTTCCCCTTTTGGTGGCAGAACTGTTACCAATAGGGGGAG 

AAGACAGCCACGGATG AAG CGTTT CTCAG CTTTTGGAATTGCTTCGACTGACATCCGTTGTT 

AACCGTTTGCCACTCTTCAGATATTTTTTATAAAAAAAGTACCACTGAGTTCATGAGGGCCA 

CAGATTGGTTATTAATGAGATACGAGGGTTGGTGCTGGGTGTTTGTTTCCTGAGCTAAGTGA 

TCAAGACTGTAGTGGAGTTGCAGCTAACATGGGTTAGGTTTAAACCATGGGGGATGCACCCC 

TTTGCGTTTCATATGTAGCCCTACTGGCTTTGTGTAGCTGGAGTAGTTGGGTTGCTTTGTGT 

TAGGAGGATCCAGATCATGTTGGCTACAGGGAGATGCTCTCTTTGAGAGGTCCTGGGCATTG 

ATTCCCATTTCAATCTCATTCTGGATATGTGTTCATTGAGTAAAGGAGGAGAGACCCTCATA 

CGCTATTTAAATGTCACTTTTTTGCCTATCCCCCGTTTTTTGGTCATGTTTCAATTAATTGT 

G AGG AAG G CG CAGCTCCT CT CTGC ACG T AGAT CATT TTTTAAAG CTAATGTAAG CACATCT A 

AGGGAATAACATGATTTAAGGTTGAAATGGCTTTAGAATCATTTGGGTTTGAGGGTGTGTTA 

TTTTGAGTCATGAATGTACAAGCTCTGTGAATCAGACCAGCTTAAATACCCACACCTTTTTT 

TCGT AGGTGGGCTTTTCCTATCAGAG CTTGGCTCATAAC CAAATAAAGTTTTTTGAAGG CCA 

TGGCTTTTCACACAGTTATTTTATTTTATGACGTTATCTGAAAGCAGACTGTTAGGAGCAGT 

ATTGAGTGGCTGTCACACTTTGAGGCAACTAAAAAGGCTTCAAACGTTTTGATCAGTTTCTT 

TTCAGGAAACATTGTGCTCTAACAGTATGACTATTCTTTCCCCCACTCTTAAACAGTGTGAT 

GTGTGTTATCCTAGGAAATGAGAGTTGGCAAACAACTTCTCATTTTGAATAGAGTTTGTGTG 

TACTTCTCCATATTTAATTTATATGATAAAATAGGTGGGGAGAGTCTGAACCTTAACTGTCA 

TGTTTTGTTGTTCATCTGTGGCCACAATAAAGTTTACTTGTAAT^ATTTTAGAGGCCATTACT 

CCAATTATGTTGCACGTACACTCATTGTACAGGCGTGGAGACTCATTGTATGTATAAGAATA 

TTTCTGACAGTGAGTGACCCGGAGTCTCTGGTGTACCCTCTTACCAGTCAGCTGCCTGCGAG 

CAGTCATTTTTTCCTAAAGGTTTACAAGTATTTAGAACTTTTCAGTTCAGGGCAAAATGTTC 

ATGAAGTTATTCCTCTTAAACATGGTTAGGAAGCTGATGACGTTATTGATTTTGTCTGGATT 

ATGTTTCTGGAATAATTTTACCAAAACAAGCTATTTGAGTTTTGACTTGACAAGGCAAAACA 

TGACAGTGGATTCTCTTTACAAATGGAAAAAAAAAATCCTTATTTTGTATAAAGGACTTCCC 

TTTTTGTAAACTAATCCTTTTTATTGGTAAAAATTGTAAATTAAAATGTGCAACTTG 
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FIGURE 4 

MSDI GDWFRS I PAI TRYWFAATVAVPLVGKLGL I S P AYLFLWPE AFLYRFQ I WR P I TATF YF 
PVGPGTGFLYLVNLYFLYQYSTRLETGAFDGR PAD YLFMLLFNWI C I VI TGLAMDMQLLM I P 
LIMSVLYVWAQLNRDMIVSFWFGTRFKACYLPWVILGFNYIIGGSVINELIGNLVGHLYFFL 
MFRYPMDLGGRNFLSTPQFLYRWLPSRRGGVSGFGVPPASMRRAADQNGG<^ 
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FIGURE 5 

GGGGCCGCGGTCTAGGGCGGC7ACGTGTGTTGCCATAGCGACCATTTTGCATTAACTGGTTG 

GTAGCTTCTATCCTGGGGGCTG AG CGACTGCG GG CCAG CT CTT CCCCTAC TCCCTCTCGG CT 

CCTTGTGGCCCAAAGGCCTAACCGGGGTCCGGCGGTCTGGCCTAGGGATCTTCCCCGTTGCC 

CCTTTGGGGCGGGATGGCTGCGGAAGAAGAAGACGAGGTGGAGTGGGTAGTGGAGAGCATCG 

CGGGGTTCCTGCGAGGCCCAGACTGGTCCATCCCCATCTTGGACTTTGTGGAACAGAAATGT 

GAAGTTAACTGCAAAGGAGGGCATGTGATAACTCCAGGAAGCCCAGAGCCGGTGATTTTGGT 

GGCCTGTGTTCCCCTTGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGATTC 

ATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAATT 

AATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAGGC 

CATTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCCAGA 

AAAACATTGAAATGCAGCTGCAAGCCATTCGAATAATTCAAGAGAGAAATGGTGTATTACCT 

GACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAATCCT 

GAGG G AAG TT CTTAGAAAATCAAAAGAGGAATATGA C CAG GAAGAA GAAAGGAAGAGG AAAA 

AACAGTTATCAGAGGCTAAAACAGAAGAGCCCACAGTGCATTCCAGTGAAGCTGCAATAATG 

AATAATTCCCAAGGGGATGGTGAACATTTTGCACACCCACCCTCAGAAGTTAAAATGCATTT 

TGCTAATCAGTCAATAGAACCTTTGGGAAGAAAAGTGGAAAGGTCTGAAACTTCCTCCCTCC 

CACAAAAAGGCCTGAAGATTCCTGGCTTAGAGCATGCGAGCATTGAAGGACCAATAGCAAAC 

TTATCAGTACTTGGAACAGAAGAACTTCGGCAACGAGAACACTATCTCAAGCAGAAGAGAGA 

T AAG TTGATG T C C AT G AG AAAG G AT ATG AG GACTAAACAG AT AC AAAAT AT GGAGC AG AAAG 

GAAAACCCACTGGGGAGGTAGAGGAAATGACAGAGAAACCAGAAATGACAGCAGAGGAGAAG 

CAAACATTACTAAAGAGGAGATTGCTTGCAGAGAAACTCAAAGAAGAAGTTATTAATAAGT^ 

ATAATTAAGAACAATTTAACAAAATGGAAGTTCAAATTGTCTTAAAAATAAATTATTTAGTC 
CTTACACTG 
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FIGURE 6 

MAAEEEDEVEWWES I AGFLRGPDWS I P I LDFVEQKCEVNCKGGHVI TPGS PEPVI LVACVP 
LVFDDEEESKLTYTEIHQEYKELVEKLLEGYLKEIGINEDQFQEACTSPLAKTHTSQAILQP 
VLAAEDFTIFKAMMVQKNIEMQLQAIRIIQERNGVLPDCLTDGSDWSDLEHEEMKILREVL 
RKSKEEYDQEEERKRKKQLSEAKTEEPTVHSSEAAIMNNSQGDGEHFAHPPSEVKMHFANQS 
IEPLGRKVERSETSSLPQKGLKIPGLEHASIEGPIANLSVLGTEELRQREHYLKQKRDKLMS 
MRKDMRTKQIQNMEQKGKPTGEVEEMTEKPEMTAEEKQTLLKRRLLAEKLKEEVINK 
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FIGURE 7 

GGGCACAGCACATGTGAAGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGAT 
TCATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAA 
TTAATGAAGATCAATTT C AAGAAG CATG CACTTCTCCT CTTG CAAAGA CC CATACATCACAG 
GCCATTTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCC 
AG AAAAAC AT TG AAATG C AG CTG C AAG C CAT T CGAATAAT TC AAGAG AG AAATGG TGT ATT A 
CCTGACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAAT 
CCTGAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAA 
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FTGURE8 

GCGTGGTTTTTGTTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTG 
TAGCTTCTCCACGTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTG 
TCTCAGCTCTAGGATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAAC 
AGTGGAATGGAAAAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATAC 
ATTCCTGCTAGGTGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATT 
CTGCCAATGAAGAAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTG 
GTGAAGCTAGTTTTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAG 
AAATTTGAAATATGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCT 
TTCTTTATTTCCTGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATG 
GCTGTTATCTTCTCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAA 
GAGGCGTCTAAACTGGATCCAGTGGGCTtCCCTCCTGACTTTATTTTTGTCTATTGTGGCCT 
TGACTGCCGGGACTAAAACTTTACAGCACAACTTGGCAGGACGTGGATTTCATCACGATGCC 
TTTTTCAGCCCTTCCAATTCCTGCCTTCTTTTCAGAAGTGAGTGTCCCAGAAAAGACAATTG 
TACAGCAAAGGAATGGACTTTTCCTGAAGCTAAATGGAACACCACAGCCAGAGTTTTCAGTC 
ACATCCGTCTTGGCATGGGCCATGTTCTTATTATAGTCCAGTGTTTTATTTCTTCAATGGCT 
AATATCTATAATGAAAAGATACTGAAGGAGGGGAACCAGCTCACTGAAAGCATCTTCATACA 
GAACAGCAAACTCTATTTCTTTGGCATTCTGTTTAATGGGCTGACTCTGGGCCTTCAGAGGA 
GTAACCGTGATCAGATTAAGAACTGTGGATTTTTTTATGGCCACAGTGCATTTTCAGTAGCC 
CTTATTTTTGTAACTGCATTCCAGGGCCTTTCAGTGGCTTTCATTCTGAAGTTCCTGGATAA 
CATGTTCCATGTCTTGATGGCCCAGGTTACCACTGTCATTATCACAACAGTGTCTGTCCTGG 
TCTTTGACTTCAGGCCCTCCCTGGAATTTTTCTTGGAAGCCCCATCAGTCCTTCTCTCTATA 

TTTATTTATAATG C CAG C AAGC CT CAAGTT CCGG AATACG CA CCT AGG CAAGAAAGGATC CG 
AGATCTAAGTGGCAATCTTTGGGAGCGTTCCAGTGGGGATGGAGAAGAACTAGAAAGACTTA 
CCAAACCCAAGAGTGATGAGTCAGATGAAGATACTTTCTAACTGGTACCCACATAGTTTGCA 
GCTCTCTTGAACCTTATTTTCACATTTTCAGTGTTTGTAATATTTATCTTTTCACTTTGATA 
AACCAGAAATGTTTCTAAATCCTAATATTCTTTGCATATATCTAGCTACTCCCTAAATGGTT 
CCATCCAAGGCTTAGAGTACCCAAAGGCTAAGAAATTCTAAAGAACTGATACAGGAGTAACA 
ATATGAAGAATTCATTAATATCTCAGTACTTGATAAATCAGAAAGTTATATGTGCAGATTAT 
TTTCCTTGGCCTTCAAGCTTCCAAAAAACTTGTAATAATCATGTTAGCTATAGCTTGTATAT 
ACACATAGAGATCAATTTGCCAAATATTCACAATCATGTAGTTCTAGTTTACATGCCAAAGT 
CTTCCCTTTTTAACATTATAAAAGCTAGGTTGTCTCTTGAATTTTGAGGCCCTAGAGATAGT 
CATTTTGCAAGTAAAGAGCAACGGGACCCTTTCTAAAAACGTTGGTTGAAGGACCTAAATAC 
CTGGCCATACCATAGATTTGGGATGATGTAGTCTGTGCTAAATATTTTGCTGAAGAAGCAGT 
TTCTCAGACACAACATCTCAGAATTTTAATTTTTAGAAATTCATGGGAAATTGGATTTTTGT 
AATAATCTTTTGATGTTTTAAACATTGGTTCCCTAGTCACCATAGTTACCACTTGTATTTTA 
AGTCATTTAAACAAGCCACGGTGGGGCTTTTTTCTCCTCAGTTTGAGGAGAAAAATCTTGAT 
GTCATTACTCCTGAATTATTACATTTTGGAGAATAAGAGGGCATTTTATTTTATTAGTTACT 
AATTCAAGCTGTGACTATTGTATATCTTTCCAAGAGTTGAAATGCTGGCTTCAGAATCATAC 
CAGATTGTCAGTGAAGCTGATGCCTAGGAACTTTTAAAGGGATCCTTTCAAAAGGATCACTT 
AGCAAACACATGTTGACTTTTAACTGATGTATGAATATTAATACTCTAAAAATAGAAAGACC 
AGTAATAT ATAAGT C A CT TT AC AG T G CT ACTT C ACA CTT AAAAGTG CATGGTATTTTTCATG 
GTATTTTGCATGCAGCCAGTTAACTCTCGTAGATAGAGAAGTCAGGTGATAGATGATATTAA 
AAATTAGCAAACAAAAGTGACTTGCTCAGGGTCATGCAGCTGGGTGATGATAGAAGAGTGGG 
CTTTAACTGGCAGGCCTGTATGTTTACAGACTACCATACTGTAAATATGAGCTTTATGGTGT 
CATTCTCAGAAACTTATACATTTCTGCTCTCCTTTCTCCTAAGTTTCATGCAGATGAATATA 
AGGTAATATACTATTATATAATTCATTTGTGATATCCACAATAATATGACTGGCAAGAATTG 
GTGG AAATTTGT AATT AAAAT AATTATTAAAC CT 
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FIGURE 9 

MEKQCCSHPVICSLSTMYTFLLGAIFIALSSSRILLVKYSANEENKYDYLPTTVNVCSELVK 
LVFCVLVSFOTIKKDHQSRNLKYASWKEFSDFMKWSIPAFLYFLDNLIVFYVLSYLQPAMAV 
I FSNFS 1 1 TTALLFRI VLKRRLNW I QWASLLTLFLS I VALTAGTKTLQHNLAGRGFHHDAFF 
SPSNSCLLFRSECPRKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVLIIVQCFISSMANI 
YNEKILKEGNQLTESIFIQNSKLYFFGILFNGLTLGLQRSNRDQIKNCGFFYGHSAFSVALI 
FVTAFQGLSVAF ILKFLDNMFHVXiMAQVTTVI I TTVS VLVFDFRPSLEFFLEAPSVLLS I F I 
YNASKPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTKPKSDESDEDTF 
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FIGURE 10 

CGTGCCTGCGCAATGGGTGTCGGGTCCGCTTTTTCCCAATCCGGACGTAATCGTGGTTTTTG 
TTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTGTAGCTTCTCCAC 
GTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTGTCTCAGCTCTAG 
GATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAACAGTGGAATGGAA 
AAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATACATTCCTGCTAGG 
TGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATTCTGCCAATGAAG 
AAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTGGTGAAGCTAGTT 
TTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAGAAATTTGAAATA 
TGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCTTTCTTTATTTCC 
TGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATGGCTGTTATCTTC 
TCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAAGAGGCGTCTAAA 
CTGGATCCAGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCTTGACTGCCGGGA 
CTAAAACTTTA 
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FIGURE 11 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGGCCGGCTTGGCTAGCGCGCGGCGGCC 
GTGGCTAAGGCTGCTACGAAGCGAGCTTGGGAGGAGCAGCGGCCTGCGGGGCAGAGGAGCAT 
CCCGTCTACCAGGTCCCAAGCGGCGTGGCCCGCGGGTCATGGCCAAAGGAGAAGGCGCCGAG 
AGCC^CTCCGCGGCGGC^CTGCTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 
GGTGAAGAAAGAACCGAAAAAGAAGAAAC^CAGTTGTCTGTTTGC^CAAGCTTTGCTATG 
CACTTGGGGGAGCCCCCTACCAGGTGACGGGCTGTGCCCTGGGTTTCTTCCTTCAGATCTAC 
CTATTGGATGTGGCTCAGGTGGGCCCTTTCTCTGCCTCCATCATCCTGTTTGTGGGCCGAGC 

CTGGGATG CCAT CACAGACC CC CTGGTGGG CCTCTGCATC AG CAAAT CCC C CTGGACCTGCC 
TGGGTCGCCTTATGCCCTGGATCATCTTCTCCACGCCCCTGGCCGTCATTGCCTACTTCCTC 

AT CTGGTTCGTG C C CGACTT C C CACACGGC CAGA CCTATTGGTAC CTGCTTTTCT ATTG CCT 
CTTTGAAACAATGGTCACGTGTTTCCATGTTCCCTACTCGGCTCTCACCATGTTCATCAGCA 
ACCGAGCAGACTGAGCGGGATTCTGCCACCGCCTATCGGATGACTGTGGAAGTGCTGGGCAC 
AGTGCTGGGCACGGCGATCCAGGGACAAATCGTGGGCCAAGCAGACACGCCTTGTTTCCAGG 
ACTTCAATAGCTCTACAGTAGCTTCACAAAGTGCCAACCATACACATGGCACCACTTCACAC 
AGGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTG 
TGCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTG 
AGCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTT 
ATTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTT 
TTGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCT 
CGGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCT 
GTATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAA 
CCTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTAC 
TACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCAT 
GGAACCGAGCCCATCTTCTTCTCCTTCTATGTCTTCTTCACCAAGTTTGCCTCTGGAGTGTC 
ACTGGGCATTTCTACCCTCAGTCTGGACTTTGCAGGGTACCAGACCCGTGGCTGCTCGCAGC 
CGGAACGTGTCAAGTTTACACTGAACATGCTCGTGACCATGGCTCCCATAGTTCTCATCCTG 
CTGGGCCTGCTGCTCTTCAAAATGTACCCCATTGATGAGGAGAGGCGGCGGCAGAATAAGAA 
GGCCCTGCAGGCACTGAGGGACGAGGCCAGCAGCTCTGGCTGCTCAGAAACAGACTCCACAG 
AGCTGGCTAGCATCCTCTAGGGCCCGCCACGTTGCCCGAAGCCACCATGCAGAAGGCCACAG 
AAGGGATCAGGACCTGTCTGCCGGCTTGCTGAGCAGCTGGACTGCAGGTGCTAGGAAGGGAA 
CTGAAGACTCAAGGAGGTGGCCCAGGACACTTGCTGTGCTCACTGTGGGGCCGGCTGCTCTG 
TGGCCTCCTGCCTCCCCTCTGCCTGCCTGTGGGGCCAAGCCCTGGGGCTGCCACTGTGAATA 
TGCCAAGGACTGATCGGGCCTAGCCCGGAACACTAATGTAGAAACCTTTTTTTTACAGAGCC 
TAATTAATAACTTAATGACTGTGTACATAGCAATGTGTGTGTATGTATATGTCTGTGAGCTA 

TT AATGTT ATTAATT TT CAT AAAAG CTGG AAAG C 
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FIGURE 12 

MWLRWALSLPPSSCLWAEPGMPSQTPWWASASANPPGPAWVALCPGSSSPRPWPSLPTSSSG 
SCPTSHTARPIGTCFSIASLKQWSRVSMFPTRLSPCSSATEQTERDSATAYRMTVEVLGTVL 
GTAIQGQIVGQADTPCFQDFNSSTVASQSANHTHGTTSHRETQKAYLLAAGVIVCIYIICAV 
I L I L G VRE QRE P YEAQQS E P I AY F RGLRLVMS HG PYIKLITGFLFTS LAFMLVE GNFVLF CT 
YTLG FRNE FQNLLLA I MLS ATLT I P I WQWFLTRFGKKTAVYVG I S S AVP FL I LVALMESNL I 
ITYAVAVAAGISVAAAFLLPWSMLPDVIDDFHLKQPHFHGTEPIFFSFYVFFTKFASGVSLG 
ISTLSLDFAGYQTRGCSQPERVKFTLNMLVTMAPIVLILLGLLLFKMYPIDEERRRQNKKAL 
QALRDEASSSGCSETDSTELASIL 
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FIGURE 13 

GGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTGT 
GCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTGA 
GCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTTA 
TTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTTT 
TGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCTC 
GGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCTG 
TATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAAC 
CTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTACT 
ACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCATG 
GAACCGAGCCCAT 
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GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGT 
ATGAGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGT 
GGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCT 
TTCATATTTTCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATAT 
CAGTGACACTGGTACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGG 
CAGTTTTATGCATTGCTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAA 
GAGAACGTTATCATCAAATTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGG 
ACTTTCTATTGTGGCAAACTTCCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTG 
TGCTTACCTTTGGTATGGGCTCATTATATATGTTTGTTCAGACCATCCTTTCCTACCAAATG 
CAGCCCAAAATCCATGGCAAACAAGTCTTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGG 
AGTAAGTGCACTTAGCATGCTGACTTGCTCATCAGTTTTGCACAGTGGCAATTTTGGGACTG 
ATTTAGAACAGAAACTCCATTGGAACCCCGAGGACAAAGGTTATGTGCTTCACATGATCACT 
ACTG C AG C AGAATGGT CTATGT CATTTT CCTT CT TTGG TT TT TT C CTG ACTT ACAT T CGTG A 
TTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACATGGATTAACCCTCTATGACACTG 
CACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCAGAGATATTTGATGAAAGGAT 
AAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGGTTCACAGAAGTTGCTTA 
TT CTTCTCTGAAATTTT CAACC ACTTAATCAAGG CT GA CAGT AACACTGATG AATG CTGATA 
ATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATCATCAAGAAGACTA 
T T AAAAACACCTAT G CCTAT ACTTT TTT AT CT CAG AAAATAAAGT CAAAAG ACTATG 
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FIGURE 15 

MWWFQQGLS FLPSALVI WTSAAF I FS YI TAVTLHH I DPAL PY I S DTGTVAPEKCLFGAMLNI 
AAVLCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSG 
AVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFG 
TDLEQKLHWNPEDKGYVLHM I TTAAEWSMS FS FFGFFLTYI RDFQKI S LRVEANLHGLTLYD 
TAPCP INNERTRLLSRD I 
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FIGURE 16 

CGGACGCTTGGGCNGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGTGCCTGATGCCGAGT 
TCCGTCTCTCGGGTCTTTTCCTGGTCCCAGGCAAAGCGGAGCGGAGATCCTCAAACGGCCTA 
GTGCTTCGCGCTTCCGGAGAAAATCAGCGGTCTAATTAATTCCTCTGGTTTGTTGAAGCAGT 
TACCAAGAATCTTCAACCCTTTCCCACAAAAGCTAATTGAGTACACGTTCCTGTTGAGTACA 
CGTTCCTGTTGATTTACAAAAGGTGCAGGTATGAGCAGGTCTGAAGACTAACATTTTGTGAA 
GTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCT 
TCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTTTCATACATTACTGCAGTAACACT 
CCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGGTACAGTANC 
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FIGURE 17 

CCCACGCGTCCGCCCGCCGCTGCGTCCCGGAGTGCAAGTGAGCTTCTCGGCTGCCCCGCGGG 

CCGGGGTGCGGAGCCGACATGCGCCCGCTTCTCGGCCTCCTTCTGGTCTTCGCCGGCTGCAC 

CTTCGCCTTGTACTTGCTGTCGACGCGACTGCCCCGCGGGCGGAGACTGGGCTCCACCGAGG 

AGGCTGGAGGCAGGTCGCTGTGGTTCCCCTCCGACCTGGCAGAGCTGCGGGAGCTCTCTGAG 

GTCCTTCGAGAGTACCGGAAGGAGCACCAGGCCTACGTGTTCCTGCTCTTCTGCGGCGCCTA 

CCTCTACAAACAGGGCTTTGCCATCCCCGGCTCCAGCTTCCTGAATGTTTTAGCTGGTGCCT 

TGTTTGGGCCATGGCTGGGGCTTCTGCTGTGCTGTGTGTTGACCTCGGTGGGTGCCACATGC 

TGCTACCTGCTCTCCAGTATTTTTGGCAAACAGTTGGTGGTGTCCTACTTTCCTGATAAAGT 

GG CCCTGCTGCAGAGAAAGGTGGAGGAGAACAGAAACAGCTTGTTTTTTTTCTTATTGTTTT 

TGAGACTTTTCCCCATGACACCAAACTGGTTCTTGAACCTCTCGGCCCCAATTCTGAACATT 

CCCATCGTGCAGTTCTTCTTCTCAGTTCTTATCGGTTTGATCCCATATAATTTCATCTGTGT 

GCAGACAGGGTCCATCCTGTCAACCCTAACCTCTCTGGATGCTCTTTTCTCCTGGGACACTG 

TCTTTAAGCTGTTGGCCATTGCCATGGTGGCATTAATTCCTGGAACCCTCATTAAAAAATTT 

AGTCAGAAACATCTGCAATTGAATGAAACAAGTACTGCTAATCATATACACAGTAGAAAAGA 

CACATGATCTGGATTTTCTGTTTGCCACATCCCTGGACTCAGTTGCTTATTTGTGTAATGGA 

TGTGGTCCTCTAAAGCCCCTCATTGTTTTTGATTGCCTTCTATAGGTGATGTGGACACTGTG 

CATCAATGTGCAGTGTCTTTTCAGAAAGGACACTCTGCTCTTGAAGGTGTATTACATCAGGT 

TTTCAAACCAGCCCTGGTGTAGCAGACACTGCAACAGATGCCTCCTAGAAAATGCTGTTTGT 

GGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCCGGTGATTC 

ACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAAGATGGTGAAATCCTGTCTCTAATAAAAAT 

ACAAAAATTAGCCAGGCGTGGTGGCAGGCACCTGTAATCCCAGCTACTCGGGAGGCTGAGGC 

AGGAGAATTGCTTGAACCAAGGTGGCAGAGGTTGCAGTAAGCCAAGATCACACCACTGCACT 

CCAGCCTGGGTGATAGAGTGAGACACTGTCTTGAC 
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FIGURE 18 

MRPLLGLLLVFAGCTFALYLLSTRLPRGRRLGSTEEAGGRSLWFPSDLAELRELSEVLREYR 
KEHQAYVFLLFCGAYLYKQGFAIPGSSFLNVLAGALFGPWLGLLLCCVLTSVGATCCYLLSS 
IFGKQLWSYFPDICVALLQRKVEENRNSLFFFLLFLRLFPMTPNWFLNLSAPILNIPIVQFF 
FSVLIGLIPYNFICVQTGSILSTLTSLDALFSWDTVFKLLAIAMVALIPGTLIKKFSQKHLQ 
LNETSTANHIHSRKDT 
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FIGURE 19 

CCGAGGCGGGAGGAGCCCGAGGGGGCGCGAGCCCCGCATGAATCATTGTAGTCAATCATTTT 

CCAGTTCTCAGCCGCTCAGTTGTGATCAAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAA 

TAGGAAAATAACTTGGGATTTTATATTGGAAGACATGGATCTTGCTGCCAACGAGATCAGCA 

TTTATGACAAACTTTCAGAGACTGTTGATTTGGTGAGACAGACCGGCCATCAGTGTGGCATG 

TCAGAGAAGGCAATTGAAAAATTTATCAGACAGCTGCTGGAAAAGAATGAACCTCAGAGACC 

CCCCCCGCAGTATCCTCTCCTTATAGTTGTGTATAAGGTTCTCGCAACCTTGGGATTAATCT 

TGCTCACTGCCTACTTTGTGATTCAACCTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTCT 

GGAGCTCACACCTGGCGCTCACTCATCCATC^CATTAGGCTGATGTCCTTGCCCATTGCCAA 

GAAGTACATGTCAGAAAATAAGGGAGTT C CTCTGCATGGGGGTGATGAAGACAGAC CCTTTC 

CAGACTTTGACCCCTGGTGGACAAACGACTGTGAGCAGAATGAGTCAGAGCCCATTCCTGCC 

AACTGCACTGGCTGTGCCCAGAAACACCTGAAGGTGATGCTCCTGGAAGACGCCCCAAGGAA 

ATTTGAGAGGCTCCATCCACTGGTGATCAAGACGGGAAAGCCCCTGTTGGAGGAAGAGATTC 

AGCATTTTTTGTGCCAGTACCCTGAGGCGACAGAAGGCTTCTCTGAAGGGTTTTTCGCCAAG 

TGGTGGCGCTGCTTTCCTGAGCGGTGGTTCCCATTTCCTTATCCATGGAGGAGACCTCTGAA 

CAGATCACAAATGTTACGTGAGCTTTTTCCTGTTTTCACTCACCTGCCATTTCCAAAAGATG 

CCTCTTTAAACAAGTGCTCCTTTCTTCACCCAGAACCTGTTGTGGGGAGTAAGATGCATAAG 

ATGCCTGACCTATTTATCATTGGCAGCGGTGAGGCCATGTTGCAGCTCATCCCTCCCTTCCA 

GTGCCGAAGACATTGTCAGTCTGTGGCCATGCCAATAGAGCCAGGGGATATCGGCTATGTCG 

ACACCACCCACTGGAAGGTCTACGTTATAGCCAGAGGGGTCCAGCCTTTGGTCATCTGCGAT 

GGAACCGCTTTCTCAGAACTGTAGGAAATAGAACTGTGCACAGGAACAGCTTCCAGAGCCGA 

AAACCAGGTTGAAAGGGGAAAAATAAAAACAAAAACGATGAAACTGCAAAAA 
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FIGURE 20 

MDLAANEISIYDKIjSETVDLVRQT 

KAaJVTLGLILLTAYFVIQPFSPIAPEPVLSGAHTTOSLIHKIRLMSLPIAKKYMSENKGVPL 
HGGDEDRPFPDFDPVWTNDCEQNESEPIPANCTGC^QKHLK^MLLEDAPRKFERLHPLVIKT 
GKPLLEEEIQHFLCQYPEATEGFSEGFFAKWWRCFPERWFPFPYPWRRPLNRSQMLRELFPV 
FTHLPFPKDASLNKCSFLHPEPWGSKMHKMPDLFI IGSGEAMLQLIPPFQCRRHCQSVAMP 
I EPGD I GYVDTTHWKVYVI ARGVQPLVI CDGTAFSEL 
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FIGURE 21 

CCACGGTGTCCGTTCTTCGCCCGGCGGCAGCTGTCCCCGAGGCGGGAGGAGCCCGAGGGGCG 
CGAGCCCCGCATGAATCATTGTAGTCAATCATTTTCCAGTTCTCAGCCGTTCAGTTGTGATC 
AAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAATAGGAAAATAACTTGGGATTTTATATT 
GGAAGACATGGATCTTGCTGCCAACGAGATCAGCATTTATGACAAACTTTCAGAGACTGTTG 
ATTTGGTGAGACAGACCGGCCATCAGTGTGGCATGTCAGAGAAGGCAATTGAAAAATTTATC 
AGACAGCTGCTGGAAAAGAATGAACCTCAGAGACCCCCCCCGCAGTATCCTCTCCTTATAGT 
TGTGTATAAGGTTCTCGCAACCTTGGGATTAATCTTGCTCACTGCCTACTTTGTGATTCAAC 
CTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTGTGGAGCTCAC 
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FIGURE 22 

CCCACGCGTCCGCCCACGCGTCCGGCTGAACACCTCTTCTTTGGAGTCAGCCACTGATGAGG 

CAGGGTCCCCACTTGCAGCTGCAGCAGCTGCAGCAGCTGCAGAGCGCTGCTCCTGGCTGGTG 

CCACTGGTGCGCACGCTGCTAGACCGTGCCTATGAGCCGCTGGGGCTGCAGTGGGGACTGCC 

CTCCCTGCCACCCACCAATGGCAGCCCCACCTTCTTTGAAGACTTCCAGGCTTTTTGTGCCA 

CACCCGAATGGCGCCACTTCATCGACAAACAGGTACAGCCAACCATGTCCCAGTTCGAAATG 

GACACGTATGCTAAGAGCCACGACCTTATGTCAGGTTTCTGGAATGCCTGCTATGACATGCT 

TATGAGCAGTGGGCAGCGGCGCCAGTGGGAGCGCGCCCAGAGTCGTCGGGCCTTCCAGGAGC 

TGGTGCTGGAACCTGCGCAGAGGCGGGCGCGCCTGGAGGGGCTACGCTACACGGCAGTGCTG 

AAGCAGCAGGCAACGCAGCACTCCATGGCCCTGCTGCACTGGGGGGCGCTGTGGCGCCAGCT 

CGCCAGCCCATGTGGGGCCTGGGCGCTGAGGGACACTCCCATCCCCCGCTGGAAACTGTCCA 

GCGCCGAGACATATTCACGCATGCGTCTGAAGCTGGTGCCCAACCATCACTTCGACCCTCAC 

CTGGAAGCCAGCGCTCTCCGAGACAATCTGGGTGAGGTTCCCCTGACACCCACCGAGGAGGC 

CTCACTGCCTCTGGCAGTGACCAAAGAGGCCAAAGTGAGCACCCCACCCGAGTTGCTGCAGG 

AGGACCAGCTCGGCGAGGACGAGCTGGCTGAGCTGGAGACCCCGATGGAGGCAGCAGAACTG 

GATGAGCAGCGTGAGAAGCTGGTGCTGTCGGCCGAGTGCCAGCTGGTGACGGTAGTGGCCGT 

GGTCCCAGGGCTGCTGGAGGTCACCACACAGAATGTATACTTCTACGATGGCAGCACTGAGC 

GCGTGGAAACCGAGGAGGGCATCGGCTATGATTTCCGGCGCCCACTGGCCCAGCTGCGTGAG 

GTCCACCTGCGGCGTTTCAACCTGCGCCGTTCAGCACTTGAGCTCTTCTTTATCGATCAGGC 

CAACTACTTCCTCAACTTCCCATGCAAGGTGGGCACGACCCCAGTCTCATCTCCTAGCCAGA 

CTCCGAGACCCCAGCCTGGCCCCATCCCACCCCATACCCAGGTACGGAACCAGGTGTACTCG 

TGGCTCCTGCGCCTACGGCCCCCCTCTCAAGGCTACCTAAGCAGCCGCTCCCCCCAGGAGAT 

GCTGCGTGCCTCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTCGAGTACT 

TGATGCT^CTCAACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTC 

CCCTGGGTCCTGCAGGACTACGTGTCCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCG 

GGACCTGTCTAAGCCCATCGGTGTGGTGAACCCCAAGCATGCCCAGCTCGTGAGGGAGAAGT 

ATGAAAGCTTTGAGGACCCAGCAGGGACCATTGACAAGTTCCACTATGGCACCCACTACTCC 

AATGCAGCAGGCGTGATGCACTACCTCATCCGCGTGGAGCCCTTCACCTCCCTGCACGTCCA 

GCTGCAAAGTGGCCGCTTTGACTGCTCCGACCGGCAGTTCCACTCGGTGGCGGCAGCCTGGC 

AGGCACGCCTGGAGAGCCCTGCCGATGTGAAGGAGCTCATCCCGGAATTCTTCTACTTTCCT 

GACTTCCTGGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGT 

AGGCGATGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCC 

AGGCTCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGACCTCATCTTTGGC 

TACAAGCAGCGGGGGCCAGCCGGCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 

GGGGG CTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCATTA 

TCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCA 

GCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTTCCAGCA 

CCTGGACGAACTCAAGGCATTCTTCGCAGAGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCA 

CCCACAGCTGGTTGCCCTATGACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCC 

ACCATGGGCAGCCACAAGACGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGT 

GAGTGGACAAGCACTGGCAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGG 

ATGGCAGCCTGCGGGTGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCAC 

CTTGATGTAGTAACCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCG 

GGACACCACGTGCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCAC 

CAAAGCCTGTGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACT 

GAACTTGACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCG 

CGGACAGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACC 

TGGCATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCC 

CAGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCCT 

GGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCCAGT 

GCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCATGAAG 

GTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCCTGGAGGA 

TGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCCAGTTCGCGC 

GGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGACGGAATACAAC 

CCTACTGAGGCGCGCTGAACCTGGCCAGTCCGGCTGCTCGGGCCCCGCCCCCGGCAGGCCTG 

GCCCGGGAGGCCCCGCCCAGAAGTCGGCGGGAACACCCCGGGGTGGGCAGCCCAGGGGGTGA 

GCGGGGCCCACCCTGCCCAGCTCAGGGATTGGCGGGCGATGTTACCCCCTCAGGGATTGGCG 

GGCGGAAGTCCCGCCCCTCGCCGGCTGAGGGGCCGCCCTGAGGGCCAGCACTGGCGTCT 
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FIGURE 23 

MSQFEMDTYAKSHDLMSGI^ACYDMLMSSGQRRQWERAQSRRAFQELVLEPAQRRARLEGL 

RYTAVLKQQATQHSMALLHWGALWRQLASPCGAWALRDTP I PRWKLSSAETYSRMRLKLVPN 

HHFDPHLEASALRDNLGEVPLTPTEEASLPI^VTKEAKVSTPPELLQEDQLGEDELAELETP 

MEAAELDEQREICLVLSAECQLVTVVAVVPGLLEVTTQNVYFYDGSTERVETEEGIGYDFRRP 

LAQLREVHLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQPGPIPPHTQV 

RNQVYSWLLRLRPPSQGYLSSRSPQEMLRASGLTQKWVQREISNFEYLMQLNTIAGRTYNDL 

S Q Y P VF PWVLQD YVS P TLDLSNP AVFRDLS KP I GWNPKHAQLVREKYES FEDPAGT I DKFH 

YGTHYSNAAGVMHYLIRVEPFTSLHVQLQSGRFDCSDRQFHSVAAAWQARLESPADVKELIP 

EFFYFPDFLENQNGFDLGCLQLTNEKVGDWLPPWASSPEDFIQQHRQALESEYVSAHLHEW 

I DL I FGYKQRGPAAEEALNVFYYCTYEGAVDLDHVTDERERKALEG 1 1 SNFGQTPCQLLKEP 

HPTRLSAEEAAHRLARLDTNSPSIFQHLDELKAFFAEVTVSASGLLGTHSWLPYDRNISNYF 

SFSKI)PTMGSHKTQRLLSGPWPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLL 

SQLSCHLDVVTCLALDTCGIYLISGSRDTTCMVWRLLHQGGLSVGLAPKPVQVLYGHGAAVS 

CVAISTELDMAVSGSEDGTVIIHTVRRGQFVAALRPLGATFPGPIFHLALGSEGQIVVQSSA 

WERPGAQVTYSLHLYSVNGKLRASLPLAEQPTALTVTEDFVLLGTAQCALHILQLNTLLPAA 

PPLPMKVAIRSVAVTKERSHVLVGLEDGKLIVWAGQPSEVRSSQFARKLWRSSRRISQVSS 

GETEYNPTEAR 
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FIGURE 24 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCC 

CACGGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCAT 

CCAAAGGCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTC 

TGGACCCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTT 

CTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCC 

GCACACTCCGTTACCACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAG 

ATAG CC CGGG TC AT CT TGG AGT AT AT TG AC CAC AAG CT CAG AGGAGTG CAGAAC CCTGTAG C 

CCGCTGCATCATGTGCTGTTTCAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCC 

TAAACCGCAATGCATACATCATGATCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAA 

AATGCGTTCATGCTACTCATGCGAAACATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGA 

CCTGCTGCTGTTCTTTGGGAAGCTGCTGGTGGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTT 

TTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAGACTTTAAGAGCCCCCACCTCAACTATTAC 

TGGCTGCCCATCATGACCTCCATCCTGGGGGCCTATGTCATCGCCAGCGGCTTCTTCAGCGT 

TTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCTGGAAGACCTGGAGCGGAACAACG 

GCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAAAGATTCTGGGCAAGAAGAAC 

GAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGGCCCTGATCCAGGACTGC 

ACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGTCTCCATTTTGTGGT 

AAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACTTTGAGAGGCTG" 

AGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCTCC 

GTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCAGCTAC 

TCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGA 

TCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAA 

AAAGATTTTATTAAAGATATTTTGTTAACT C 
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FIGURE 25 

RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLF 
WTLNWVLALGQCVLAGAFAS FYWAFHKPQD I PTFPL I S AF I RTLRYHTGS LAFGALILTLVQ 
IARVILEYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAK 
NAFMLLMRNIVRVVVLDKOTDLLLFFGKL^^ 

WLPIMTSILGAYVIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKN 
EAPPDNKKRKK 



WO 99/63088 _g {fa 0 PCT/US99/12252 

FIGURE 26 

GAGTCTTGACCGCCGCCGGGCTCTTGGTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGT 
GGCTATGTTCGTGTCCGATTTCCGCAAAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCC 
TTCTCTTCGTGGCCTCGGACGTGGATGCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTC 
CAGTGTGACCACGTGCAATATACGCTGGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGC 
ATTTCTTGAGCATAAAGAACAGTTTCATTATTTTATTCTCATAAACTGTGGAGCTAATGTAG 
ACCTATTGGATATTCTTCAACCTGATGAAGACACTATATTCTTTGTGTGTGACTCCCATAGG 
CCAGTCAATGTCGTCAATGTATACAACGATACCCAGATCAAATTACTCATTAAACAAGATGA 
TGACCTTGAAGTTCCCGCCTATGAAGACATCTTCAGGGATGAAGAGGAGGATGAAGAGCATT 

CAGG AAATGACAGTGATGGGTC AGAG CCTT CTGAGAAG CG CACACGGTTAGAAG AGGAGATA 
GTGGAGCAAACCATGCGGAGGAGGCAGCGGCGAGAGTGGGAGGCCCGGAGAAGAGACATCCT 
CTTTGACTACGAGCAGTATGAATATCATGGGACATCGTCAGCCATGGTGATGTTTGAGCTGG 
CTTGGATGCTGTCCAAGGACCTGAATGACATGCTGTGGTGGGCCATCGTTGGACTAACAGAC 
CAGTGGGTGCAAGACAAGATCACTCAAATGAAATACGTGACTGATGTTGGTGTCCTGCAGCG 
CCACGTTTCCCGCCACAACCACCGGAACGAGGATGAGGAGAACACACTCTCCGTGGACTGCA 
CACGGATCTCCTTTGAGTATGACCTCCGCCTGGTGCTCTACCAGCACTGGTCCCTCCATGAC 
AGCCTGTGCAACACCAGCTATACCGCAGCCAGGTTCAAGCTGTGGTCTGTGCATGGACAGAA 
GCGGCTCCAGGAGTTCCTTGCAGACATGGGTCTTCCCCTGAAGCAGGTGAAGCAGAAGTTCC 

AG G C C ATG GA CAT C T C CT T G AAG G AG AAT TTG CGGG AAAT G ATTGAAG AGTCTGCAAAT AAA 
TTTGGGATGAAGGACATGCGCGTGCAGACTTTCAGCATTCATTTTGGGTTCAAGCACAAGTT 
TCTGGCCAGCGACGTGGTCTTTGCCACCATGTCTTTGATGGAGAGCCCCGAGAAGGATGGCT 
CAGGGACAGATCACTTCATCCAGGCTCTGGACAGCCTCTCCAGGAGTAACCTGGACAAGCTG 
TACCATGGCCTGGAACTCGCCAAGAAGCAGCTGCGAGCCACCCAGCAGACCATTGCCAGCTGC 
CTTTGCACCAACCTCGTCATCTCCCAGGGGCCTTTCCTGTACTGCTCTCTCATGGAGGGCAC 
TC CAGATGTCATGCTGTTCTCT AGGC CGGCATCC CTAAGC CT GCTCAG CAAACACCTG CTCA 
AGTCCTTTGTGTGTTCGACAAAGAACCGGCGCTGCAAACTGCTGCCCCTGGTGATGGCTGCC 
CCCCTGAGCATGGAGCATGGCACAGTGACCGTGGTGGGCATCCCCCCAGAGACCGACAGCTC 
GGACAGGAAGAACTTTTTTGGGAGGGCGTTTGAGAAGGCAGCGGAAAGCACCAGCTCCCGGA 
TGCTGCACAACCATTTTGACCTCTCAGTAATTGAGCTGAAAGCTGAGGATCGGAGCAAGTTT 
CTGGACGCACTTATTTCCCTCCTGTCCTAGGAATTTGATTCTTCCAGAATGACCTTCTTATT 
TATGTAACTGGCTTTCATTTAGATTGTAAGTTATGGACATGATTTGAGATGTAGAAGCCATT 
TTTT ATT AAAT AAAATG CTTATTTT AGG AAA 
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FIGURE 27 

MFVSDFRKEFYEWQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAF 
LEHKEQFHYFILINCGANVDLLDILQPDEDTIFFVCDSHRPVNWNVYNDTQIKLLIKQDDD 
LEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLEEEIVEQTMRRRQRREWEARRRDILF 
DYEQ^TYHGTSSAMVMFELAWMLSKDL^DMLWWAIVGLTDQWQDKITQMKYVTDVGVLQ^ 
VSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCOTSYTAARFKLWSVHGQKR 
LQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFL 
ASDWFATMSIMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQQTIASCL 
CTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAP 
LSMEHGTVTVVGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFL 
DALISLLS 
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FIGURE 28 

GTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGTGGCTATGNTCGTGTCCGATTTCCGCA 
AAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCCTTCTCTTCGTGGCCTCGGANGTGGAT 
GCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTCCAGTGTGACCANGTGCAATATANGCT 
GGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGCATTTCTTGAGCATAAAGAACAGTTTC 
ATTATTTTATTCTCATAAACTGTGGAGCTAATGTAGACCTATTGGATATTCTTCAACCTGAT 
GAAGACACTATATTCTTTGTGTGTGACACCCATAGGCCAGTCAATGTTGTCAATGTATACAA 
CGATACCC 
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FIGURE 29 

CAGGAACCCTCTCTTTGGGTCTGGATTGGGACCCCTTTCCAGTACCATTTTTTCTAGTGAAC 
CACGAAGGGACGATACCAGAAAACACCCTCAACCCAAAGGAAATAGACTACAGCCCCAATTG 
GCTGACTTTGGCTATAGAAAAAAGAAAGGAACGAAAAGAGACAGTTTTTTTTGGAAAGCTAA 
GTCTTCCCTTTATCGAGTCAAGAAACCCCCCCTTCTTGAGCTATTTACAGCTTTTAACAATT 
GAGTAAAGTACGCTCCGGTCACCAIGGTGACAGCCGCCCTGGGTCCCGTCTGGGCAGCGCTC 
CTGCTCTTTCTCCTGATGTGTGAGATCCGTATGGTGGAGCTCACCTTTGACAGAGCTGTGGC 
CAGCGGCTGCCAACGGTGCTGTGACTCTGAGGACCCCCTGGATCCTGCCCATGTATCCTCAG 
CCTCTTCCTCCGGCCGCCCCCACGCCCTGCCTGAGATCAGACCCTACATTAATATCACCATC 
CTGAAGGGTGACAAAGGGGACCCAGGCCCAATGGGCCTGCCAGGGTACATGGGCAGGGAGGG 
TCCCCAAGGGGAGCCTGGCCCTCAGGGCAGCAAGGGTGACAAGGGGGAGATGGGCAGCCCCG 
GCGCCCCGTGCCAGAAGCGCTTCTTCGCCTTCTCAGTGGGCCGCAAGACGGCCCTGCACAGC 
GGCGAGGACTTCCAGACGCTGCTCTTCGAAAGGGTCTTTGTGAACCTTGATGGGTGCTTTGA 
CATGGCGACCGGCCAGTTTGCTGCTCCCCTGCGTGGCATCTACTTCTTCAGCCTCAATGTGC 
ACAG CTGGAATTACAAGGAGACGTACGTGCACATTATG CATAAC CAGAAAGAGG CTGTCATC 
CTGTACGCGCAGCCCAGCGAGCGCAGCATCATGCAGAGCCAGAGTGTGATGCTGGACCTGGC 
CTACGGGGACCGCGTCTGGGTGCGGCTCTTCAAGCGCCAGCGCGAGAACGCCATCTACAGCA 
ACGACTTCGACACCTACATCACCTTCAGCGGCCACCTCATCAAGGCCGAGGACGACTG^GGG 
CCTCTGGGCCACCCTCCCGGCTGGAGAGCTCAGGTGCTGGTCCCGTCCCCTGCAGGGCTCAG 
TTTGCACTGCTGTGAAGCAGGAAGGCCAGGGAGGTCCCCGGGGACCTGGCATTCTGGGGAGA 
CCCTGCTTCTATCTTGGCTGCCATCATCCCTCCCAGCCTATTTCTGCTCCTCTCTTCTCTCT 
TGGACCTATTTTAAGAAGCTTGCTAACCTAAATATTCTAGAACTTTCCCAGCCTCGTAGCCC 
AGCACTTCTCAAACTTGGAAATGCATGCGAATCACCCGGGGTTCGTGTTAAATGCAGATTCT 
GACTCAGCAGGTCTGAGTGGGTCCAGGATTCTGTGTTTCTCATATGTTCCTGGGTGATGCTG 
ATGGGGTCAGTCTATGAACCACACTGGAGCAACCAGGTTCTAGGACTTTCTCAATATTCTAG 
TACTTTCTGAACATTCTGGAATCCTCCCCACATTCTAGAATTCTCCCAACATTTTTTTTTCT 
TGAGACAGAGTCTTGCTCTGTTGCCCAGGCTAGAGTGCAGTGGTGCAATCTCAGTTCACTGC 
AACCTCTGCCTCCCGGGTTCAAGCGATTCTTCTGCCTCAGCCTCCCTAGTGGCTGGGATTAC 
AGGCGCCTGCTACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATA 
TTGGCCAGGCTGGTCTTGAACTCCTGACTTCAGGTGACCCACCCGCCTCGGCCTCTCAAAAT 
GCTGGGATTACAGGTGTGAGCCACCGTGCCTGGCCAATTCCAACATTCTTAAATTCTCTCAT 
CCCTCCAGGGCTCCCCGTGCTATGTTCTCTTTACCCCTTCCCCCTCTTCTCTTGCTCAGGCC 
TG CA C CACTG C AG CCAC CGTTCATTTATT C ATT C ATT AAAC ACTG AG CACT CACT CT GTG CT 
GGGTCCCGGGAAGGGTGAGGGGGTCAGACACAGGCCCTGCCCCTGCCCTCAGTGACTGGCCA 
G T C C AG C C CAGGCGGGGAGAGATGTGTACATAGG TTTT AAAG CAGACC CAG AGC TCATGGGG 
GCCTGTGTTCTGGGTGTTCAGGTGCTGCTGGTCCTCCATTACCCACTGCTCCCCAAGGCTGG 
TGGGACGGGGTCCCGGTGGCAGGGGCAGGTATCTCCTTCCCGTTCCTCATCCACCTGCCCAG 
TGCTCATCGTTACAGCAAACCCCAGGGGGCCTTGGCCAGGTCAAGGGTTCTGTGAGGAGAGG 
ACCCAGGAGTGTGGGGGCATTTGGGGGGTGAAGTGGCCCCCGAAGAATGGAACCCACACCCA 
TAGCTCTCCCCACAGCTGATACGGCATCCTGCGAGAAGACCTGCCCTCCTCACTGGGATCCC 
CTTCCTGCCTCCTCCCAGGGCTCTGCCAGGGCCTTGCTCAGTCCCTTCCACCAAAGTCATCT 
GAACTTCCGTTTCCCCAGGGCCTCCAGCTGCCCTCAGACACTGATGTCTGTCCCCAGGTGCT 
CTCTGCCCCTCATGCCCCTCTCACCGGCCCAGTGCCCCGACTCTCCAGGCTTTATCAAGGTG 
CTAAGGCCCGGGTGGGCAGCTCCTCGTCTCAGAGCCCTCCTCCGGCCTGGTGCTGCCTTTAC 
AAACACCTGCAGGAGAAGGGCCACGGAAGCCCCAGGCTTTAGAGCCCTCAGCAGGTCTGGGG 
AGCTAGAGCAAAGGAGGGACCTCAGGCCTTCCGTTTCTTCTTCCAGGGTGGGGTGGCCTGGT 
GTTCCCCTAGCCTTCCAAACCCAGGTGGCCTGCCCTTCTCCCCAGAGGGAGGCGGCCTCCGC 
CCATTGGTGCTCATGCAGACTCTGGGGCTGAGGTGCCCCGGGGGGTGATCTCTGGTGCTCAC 
AGCCGAGGGAGCCGTGGCTCCATGGCCAGATGACGGAAACAGGGTCTGACCAAGTGCCAGGA 
AGACCTGTGCTATAAACCACCCTGCCTGATCCTGCCCCTGCCTGACCCCGCCACGCCCTGCC 
GTCCAGCATGATTAAAGAATGCTGTCTCCTCTTGGAAAAAAAAAAAAAAAA 
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FIGURE 30 

MVTAALGFVWAALLLFLLMCEIRMVELTFDRAVASGCQRCCDSEDPLDPAHVSSASSSGRPH 
ALPEIRPYINITILKGDKGDPGPMGLPGYMGREGPQGEPGPQGSKGDKGEMGSPGAPCQKRF 
FAFSVGRKTALHSGEDFQTLLFERVFVNLDGCFDl^TGQFAAPLRGIYFFSLNVHSWNYKET 
YVHIMHNQKEAVILYAQPSERSIMQSQSVMLDLAYGDRVWVRLFKRQRENAIYSNDFDTYIT 
FSGHLIKAEDD 



Important features : 
Signal peptide: 

amino acids 1-20 



N-glycosylation site. 

amino acids 72-75 

Clq domain proteins . 

amino acids 144-178, 78-111 and 84-117 
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FIGURE 31 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTG 

AGGCCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTG 

CTCCAGGGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGG 

GCTGCCCATCCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGG 

ACCACGGTGCCCCGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCAT 

CACCATGCTCGACGCAGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCAC 

AGAACC^GATCGCCAGCCTGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTG 

GACCTGACGGCCAACAGGCTGCATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCT 

CGAGCGCCTCTACCTGGGCAAGAACCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGC 

TCGACCGCCTCCTGGAGCTCAAGCTGCAGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGC 

CTGCCCCGCCTGCTGCTGCTGGACCTCAGCCACAACAGCCTCCTGGCCCTGGAGCCCGGCAT 

CCTGGACACTGCCAACGTGGAGGCGCTGCGGCTGGCTGGTCTGGGGCTGCAGCAGCTGGACG 

AGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACCTGGATGTGTCCGACAACCAGCTGGAG 

CGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACGCGCCTGCGGCTGGCCGGCAACAC 

CCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGCTGCCCTGCAGGAGCTGGATG 

TGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCCTCTTCCCCCGCCTGCGG 

CTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGCTGGTTTGGCCCCTG 

GGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCACTTCCCGCCCA 

AGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAGCCACCACC 

ACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCTTCTAG 

CTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTCCA 

CTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTC 

AATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTT 

CACGGGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCA 

CGCCGAGGCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGC 

GTGGGGCTGCAGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTA 

TCGCAACCTATCGGGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTG 

AGTACACGGTCACCCAGCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGG 

CCCGGGCGGGTGCCGGAGGGCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCA 

CTCCAACCACGCCCCAGTCACCCAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCG 

CCCTGGCCGCGGTGCTCCTGGCCGCGCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGG 

GGGCGGGCCATGGCAGCAGCGGCTCAGGACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCT 

GGAACTGGAGGGAGTGAAGGTCCCCTTGGAGCCAGGCCCGAAGGCAACAGAGGGCGGTGGAG 

AGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCCACTCATGGGCTTCCCAGGGCCTGGCCTC 

CAGTCACCCCTCCACGCAAAGCCCTACATCIAAGCCAGAGAGAGACAGGGCAGCTGGGGCCG 

GGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCCACACCACGTAAGTTCTCAGTCC 

CAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCTGGGCCCTGTTCCCTCTGGA 

CCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCCCTAACGTCCCCAGAAC 

CGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTCCCTGGGCACGGCG 

GGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCACTCCAGGCGGA 

CCCTGGGGGC CAGT GAAGGAAG CT C CCGGAAAGAG CAG AGGG AG AG CGGGT AGG CGG CTGTG 

TGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGCTTTA 

GG AACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCTG 

GG AAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGATATG 

AAGGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 32 

MCSRVPLLLPLLLLLALC-PGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFEN 
GITMLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLR 
RLERLYLGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEP 

G I LDT ANVEALRLAGLGLQQ LD EGLF SRLRNLHDLDVSDNQLERVP P V I RGLRGLTRLRLAG 
m^IAQLRPEDLAGIJVALQELDVSNLSLQALPGDLSGLFPRLRLIAAARNPFNCVCPLSWFG 
PWWESHVTLASPEETRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPVVREPTALS 
SSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPE 
GFTGLYCESQMGQGTRPSPTPVTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRL 
TYRNLSGPDKRLVTLRLPASLAEYTVTQL^ 

VHSNHAPVTQAREGNLPLL I AP ALAAVIjLAALAAVGAA YCVRRGRAMAAAAQDKGQ VG PG AG 
PLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPGPGLQSPLHAKPYI 
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FIGURE 33 

GAATCATCCACGCACCTGCAGCTCTGCTGAGAGAGTGCAAGCCGTGGGGGTTTTGAGCTCAT 
CTTCATCATTCATATGAGGAAATAAGTGGTAAAATCCTTGGAAATACAATGAGACTCATCAG 
AAACATTTACATATTTTGTAGTATTGTTATGACAGCAGAGGGTGATGCTCCAGAGCTGCCAG 
AAGAAAGGGAACTGATGACCAACTGCTCCAACATGTCTCTAAGAAAGGTTCCCGCAGACTTG 
ACCCCAGCCACAACGAC^CTGGATTTATCCTATAACCTCCTTTTTCAACTCCAGAGTTCAGA 
TTTTCATTCTGTCTCCAAACTGAGAGTTTTGATTCTATGCCATAACAGAATTCAACAGCTGG 
ATCTCAAAACCTTTGAATTCAACAAGGAGTTAAGATATTTAGATTTGTCTAATAACAGACTG 
AAGAGTGTAACTTGGTATTTACTGGCAGGTCTCAGGTATTTAGATCTTTCTTTTAATGACTT 
TGACACCATGCCTATCTGTGAGGAAGCTGGCAACATGTCACACCTGGAAATCCTAGGTTTGA 
GTGGGGCAAAAATACAAAAATCAGATTTCCAGAAAATTGCTCATCTGCATCTAAATACTGTC 
TTCTTAGGATTCAGAACTCTTCCTCATTATGAAGAAGGTAGCCTGCCCATCTTAAACACAAC 
AAAACTGCACATTGTTTTACCAATGGACACAAATTTCTGGGTTCTTTTGCGTGATGGAATCA 
AGACTTCAAAAATATTAGAAATGACAAATATAGATGGCAAAAGCCAATTTGTAAGTTATGAA 
ATGCAACGAAATCTTAGTTTAGAAAATGCTAAGACATCGGTTCTATTGCTTAATAAAGTTGA 
TTTACTCTGGGACGACCTTTTCCTTATCTTACAATTTGTTTGGCATACATCAGTGGAACACT 
TTCAGATCCGAAATGTGACTTTTGGTGGTAAGGCTTATCTTGACCACAATTCATTTGACTAC 
TCAAATACTGTAATGAGAACTATAAAATTGGAGCATGTACATTTCAGAGTGTTTTACATTCA 
ACAGGATAAAATCTATTTGCTTTTGACCAAAATGGACATAGAAAACCTGACAATATCAAATG 
CAC^AATGCCACACATGCTTTTCCCGAATTATCCTACGAAATTCCAATATTTAAATTTTGCC 
AATAATATCTTAACAGACGAGTTGTTTAAAAGAACTATCCAACTGCCTCACTTGAAAACTCT 
CATTTTGAATGGCAATAAACTGGAGACACTTTCTTTAGTAAGTTGCTTTGCTAACAACACAC 
CCTTGGAACACTTGGATCTGAGTCAAAATCTATTACAACATAAAAATGATGAAAATTGCTCA 
TGGCCAGAAACTGTGGTCAATATGAATCTGTCATACAATAAATTGTCTGATTCTGTCTTCAG 
GTGCTTGCCCAAAAGTATTCAAATACTTGACCTAAATAATAACCAAATCCAAACTGTACCTA 
AAGAGACTATTCATCTGATGGC CTTACGAGAA CT AAATATTG CATTTAATTTTCTAACTGAT 
CTCCCTGGATGCAGTCATTTCAGTAGACTTTCAGTTCTGAACATTGAAATGAACTTCATTCT 
CAGCCCATCTCTGGATTTTGTTCAGAGCTGCCAGGAAGTTAAAACTCTAAATGCGGGAAGAA 
ATCCATTCCGGTGTACCTGTGAATTAAAAAATTTCATTCAGCTTGAAACATATTCAGAGGTC 
ATGATGGTTGGATGGTCAGATTCATACACCTGTGAATACCCTTTAAACCTAAGGGGAACTAG 
GTTAAAAGACGTTCATCTCCACGAATTATCTTGCAACACAGCTCTGTTGATTGTCACCATTG 
TGGTTATTATGCTAGTTCTGGGGTTGGCTGTGGCCTTCTGCTGTCTCCACTTTGATCTGCCC 
TGGTATCTCAGGATGCTAGGTCAATGCACACAAACATGGCACAGGGTTAGGAAAACAACCCA 
AG AACAACT CAAGAGAAATGTC CG ATT C CACGC ATTT ATTTCATACAGTG AACATG ATTCT C 
TGTGGGTGAAGAATGAATTGATCCCCAATCTAGAGAAGGAAGATGGTTCTATCTTGATTTGC 
CTTTATGAAAGCTACTTTGACCCTGGCAAAAGCATTAGTGAAAATATTGTAAGCTTCATTGA 
GAAAAGCTATAAGTCCATCTTTGTTTTGTCTCCCAACTTTGTCCAGAATGAGTGGTGCCATT 
ATGAATTCTACTTTGCCCACCACAATCTCTTCCATGAAAATTCTGATCATATAATTCTTATC 
TTACTGGAACCCATTCCATTCTATTGCATTCCCACCAGGTATCATAAACTGAAAGCTCTCCT 
GGAAAAAAAAGCATACTTGGAATGGCCCAAGGATAGGCGTAAATGTGGGCTTTTCTGGGCAA 
ACCTTCGAGCTGCTATTAATGTTAATGTATTAGCCACCAGAGAAATGTATGAACTGCAGACA 
TTCACAGAGTTAAATGAAGAGT CT CGAGGTTCTACAATCTCTCTGATGAGAACAGATTGTCT 
ATAAAATCCCACAGTCCTTGGGAAGTTGGGGACCACATACACTGTTGGGATGTACATTGATA 
CAACCTTTATGATGGCAATTTGACAATATTTATTAAAATAAAAAATGGTTATTCCCTTCATA 
TCAGTTTCTAGAAGGATTTCTAAGAATGTATCCTATAGAAACACCTTCACAAGTTTATAAGG 
GCTTATGGAAAAAGGTGTTCATCCCAGGATTGTTTATAATCATGAAAAATGTGGCCAGGTGC 
AGTGGCTCACTCTTGTAATCCCAGCACTATGGGAGGCCAAGGTGGGTGACCCACGAGGTCAA 
GAGATGGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA 
GCTGGGCGTGATGGTGCACGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCG 
CTTGAACCCGGGAGGTGGCAGTTGCAGTGAGCTGAGATCGAGCCACTGCACTCCAGCCTGGT 
GACAG AG C GAGA CTC CAT CT CAAAAAAAAGAAAAAAAAAAAAG AAAAAAATGGAAAACATCC 
T C ATGG C CAC AAAATAAGGT CTAATT CAAT AAATTATAGTAC ATT AATGT AATATAATATTA 
CATGCCACTAAAAAGAATAAGGTAGCTGTATATTTCCTGGTATGGAAAAAACATATTAATAT 
GTTATAAACTATTAGGTTGGTGCAAAACTAATTGTGGTTTTTGCCATTGAAATGGCATTGAA 
ATAAAAGTGTAAAGAAATCTATACCAGATGTAGTAACAGTGGTTTGGGTCTGGGAGGTTGGA 
TT ACAGGG AG CATTTG ATTT CTATGTTGTGTATTTCTATAATGTTTGAATTGTTTAGAATGA 
ATCTGTATTTCTTTTATAAGTAGAAAAAAAATAAAGATAGTTTTTACAGCCT 
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FIGURE 34 

MRLIRNIYIFC5IVNTAEGDAPELPEERELMTNCSNMSLRKVPADLTPATTTLDLSYNLLFQ 
LQSSDFHSVSKLRVLILCHNRIQQLDLKTFEFNKELRYLDLSNNRLKSVTWYLLAGLRYLDL 
SFNDFDTMPICEEAGNMSHLEIIX3LSGAKIQKSDFQKIAHLHLNTVFLGFRTLPHYEEGSLP 
II^TKIillVLPMDTNFWVLLRDGIKTSKILEMTNIDGKSQFVSYEMORNLSLENAKTSVLL 
LNKVDLLWDDLFLILQFVWHTSVEH^ 

VF Y I QQDK I YLLLTKMD I ENLT I SNAQMPHMLFPNYPTKFQYLNFANNI LTDELFKRTIQLP 
HLKTLII^GNKLETLSLVSCFANNTPLEHLDLSQN^^ 

DSVFRCLPKSIQILDLNNNQIQTVPKETIHLMALRELNIAFNFLTDLPGCSHFSRLSVLNIE 
MNF I LS PSLDFVQS CQEVKTLNAGRNPFRCTCELKNF I QLETYS EVMMVGWSDS YTCE YPLN 
LRGTRLKDVHLHELSCTJTALLIVTIWI^ 

RKTTQEQLKRNVRFHAFISYSEHDSLWVKNELIPNLEKEDGSILICLYESYFDPGKSISENI 

VSFIEKSYKSIFVLSPNFVQNEWCHYEFYFAHHNLFHENSDHIILILLEPIPFYCIPTRYHK 

LKALLEKKAYLEWPKDRRKCGLFWANLRAAINVNVIJVTREMYELQTFTELNEESRGSTISLM 
RTDCL 
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FIGURE 35 A 

GGGGGCTTTCTTGGGCTTGGCTGCTTGGAACACCTGCCTCCAAGGACCGGCCTCGGAGGGGT 
CGCCGGGAAAGGGAGGGAAGAAGGAAGGGCGGGGCCGGCCCCCCTGCGCCCGCCCCGCGCCT 
CTGCGCGCCCCTGTCCGCCCCGGCCCAGCCCAGCCCAGCCCCGCGGGCCGGTCACACGCGCA 
GCCAGCCGGCCGCCTCCCGCGCCCAAGCGCGCCGCTCTGCTGTGCCCTGCGCCCTTGCCCCG 
CGCCAGCTTCTGCGCCCGCAGCCCGCCCGGCGCCCCCGGTGACCGTGACCCTGCCCTGGGCG 
CGGGGCGGAGCAGGCATGTCCCGCCCGGGGACCGCTACCCCAGCGCTGGCCCTGGTGCTCCT 
GGCAGTGACCCTGGCCGGGGTCGGAGCCCAGGGCGCAGCCCTCGAGGACCCTGATTATTACG 
GGCAGGAGATCTGGAGCCGGGAGCCCTACTACGCGCGCCCGGAGCCCGAGCTCGAGACCTTC 
TCTCCGCCGCTGCCTGCGGGGCCCGGGGAGGAGTGGGAGCGGCGCCCGCAGGAGCCCAGGCC 
GCCCAAGAGGGCCACCAAGCCCAAGAAAGCTCCCAAGAGGGAGAAGTCGGCTCCGGAGCCGC 
CTCCACCAGGTAAACACAGCAACAAAAAAGTTATGAGAACCAAGAGCTCTGAGAAGGCTGCC 
MCGATGATCACAGTGTCCGTGTGGCCCGTGAAGATGTCAGAGAGAGTTGCCCACCTCTTGG 
TCTGGAAACCTTAAAAATCACAGACTTCCAGCTCCATGCCTCCACGGTGAAGCGCTATGGCC 
TGGGGGCACATCGAGGGAGACTCAACATCCAGGCGGGCATTAATGAAAATGATTTTTATGAC 
GGAGCGTGGTGCGCGGGAAGAAATGACCTCCAGCAGTGGATTGAAGTGGATGCTCGGCGCCT 
GACCAGATTCACTGGTGTCATCACTCAAGGGAGGAACTCCCTCTGGCTGAGTGACTGGGTGA 
CATCCTATAAGGTCATGGTGAGCAATGACAGCCACACGTGGGTCACTGTTAAGAATGGATCT 
GGAGACATGATATTTGAGGGAAACAGTGAGAAGGAGATCCCTGTTCTCAATGAGCTACCCGT 
CCCCATGGTGGCCCGCTACATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCT 
GCATGAGAATGGAGATCCTGGG CTGCCC ACTG CC AGAT CCTAAT AATT ATTATC ACCGCCGG 
AACGAGATGACCACCACTGATGACCTGGATTTTAAGCACCACAATTATAAGGAAATGCGCCA 
GTTGATGAAAGTTGTGAATGAAATGTGTCCCAATATCACCAGAATTTACAACATTGGAAAAA 
GCCACCAGGGCCTGAAGCTGTATGCTGTGGAGATCTCAGATCACCCTGGGGAGCATGAAGTC 
GGTGAGCCCGAGTTCCACTACATCGCGGGGGCCCACGGCAATGAGGTGCTGGGCCGGGAGCT 
GCTGCTGCTGCTGGTGCAGTTCGTGTGTCAGGAGTACTTGGCCCGGAATGCGCGCATCGTCC 
ACCTGGTGGAGGAGACGCGGATTCACGTCCTCCCCTCCCTCAACCCCGATGGCTACGAGAAG 
GCCTACGAAGGGGGCTCGGAGCTGGGAGGCTGGTCCCTGGGACGCTGGACCCACGATGGAAT 
TGACATCAACAACAACTTTCCTGATTTAAACACGCTGCTCTGGGAGGCAGAGGATCGACAGA 
ATGTCCCCAGGAAAGTTCCCAATCACTATATTGCAATCCCTGAGTGGTTTCTGTCGGAAAAT 
GC CACGGTGGCTGCCGAGAC CAGAGCAGTCATAGCCTGGATGGAAAAAAT CCCTTTTGTG CT 
GGGCGGCAACCTGCAGGGCGGCGAGCTGGTGGTGGCGTATCCCTACGACCTGGTGCGGTCCC 
CCTGGAAGACGCAGGAACACACCCCCACCCCCGATGACCACGTGTTCCGCTGGCTGGCCTAC 
TCCTATGCCTCCACACACCGCCTCATGACAGACGCCCGGAGGAGGGTGTGCCACACGGAGGA 
CTTCCAGAAGGAGGAGGGCACTGTCAATGGGGCCTCCTGGCACACCGTCGCTGGAAGTCTGA 
ACGATTTCAGCTACCTTCATACAAACTGCTTCGAACTGTCCATCTACGTGGGCTGTGATAAA 
TACCCACATGAGAGCCAGCTGCCCGAGGAGTGGGAGAATAACCGGGAATCTCTGATCGTGTT 
CATGGAGCAGGTTCATCGTGGCATTAAAGGCTTGGTGAGAGATTCACATGGAAAAGGAATCC 
CAAACGCCATTATCTCCGTAGAAGGCATTAACCATGACATCCGAACAGCCAACGATGGGGAT 
TACTGGCGCCTCCTGAACCCTGGAGAGTATGTGGTCACAGCAAAGGCCGAAGGTTTCACTGC 
ATCCACCAAGAACTGTATGGTTGGCTATGACATGGGGGCCACAAGGTGTGACTTCACACTTA 
GCAAAACCAACATGGCCAGGATCCGAGAGATCATGGAGAAGTTTGGGAAGCAGCCCGTCAGC 
CTGCCAGCCAGGCGGCTGAAGCTGCGGGGGCGGAAGAGACGACAGCGTGGGTGACCCTCCTG 
GGCCCTTGAGACTCGTCTGGGACCCATGCAAATTAAACCAACCTGGTAGTAGCTCCATAGTG 
GACTCACTCACTGTTGTTTCCTCTGTAATTCAAGAAGTGCCTGGAAGAGAGGGTGCATTGTG 
AGGCAGGTCCCAAAAGGGAAGGCTGGAGGCTGAGGCTGTTTTCTTTTCTTTGTTCCCATTTA 
TCCAAATAACTTGGACAGAGCAGCAGAGAAAAGCTGATGGGAGTGAGAGAACTCAGCAAGCC 
AACOTGGGAATCAGAGAGAGAAGGAGAAGGAGGGGAGCCTGTCCGTTCAGAGCCTCTGGCTGC 
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ATAGAAAAGGATTCTGGTGCTTCCCCTGTTTGCGTGGCAGCAAGGGTTCCACGTGCATTTGC 

AATTTGCACAGCTAAAATTGCAGCATTTCCCCAGCTGGGCTGTCCCAAATGTTACCATTTGA 

GATGCTCCCAGGCGTCCTAAGAGAATCCACCCTCTCTGGCCCTGGGACATTGCAAGCTGCTA 

CAAATAAATTCTGTGTTCTTTTGACAATAGCGTCATTGCCAAGTGCACATCAGTGAGCCTCT 

TGAATCTGTTTAGTCTCCTTTTTCAACAAAGGAGTGTGTTCAGAAAAGGAGAGAGAGGCTGA 

GATCATTCAGGAGTTTGTTGGGCAGCAAGCATGGAGCTTCTTGCACAAATTCTGGGTCCATA 

AACAACCCCCAAAGTCCCTGCTGATCCAGTAGCCCTGGAGGTTCCCCAGGTAGGGAGAGCCA 

GAGGTGCCAGCCTTCCTGAAGGGCCAGAAAATTTAGCCTGGATCTCCTCTTTTACCTGCTAG 

GACTGGAAAGAGCCAGAAGTGGGGTGGCCTGAAGCCCTCTCTCTGCTTGAGGTATTGCCCCT 

GTGTGGAATTGAGTGCTCATGGGTTGGCCTCATATCAGCCTGGGAGTTATTTTTGATATGTA 

GAATGCCAGATCTTCCAGATTAGGCTAAATGTAATGAAAACCTCTTAGGATTATCTGTGGAG 

CATCAGTTTGGGAAGAATTATTGAATTATCTTGCAAGAAAAAAGTATGTCTCACTTTTTGTT 

AATGTTGCTGCCTCATTGACCTGGGAAAAATGAAAAAAAAAAATAAAGCAAATGGTAAGACC 

CTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 36 

MSRPGTATPALALVLI^VTLAGVGAQGAALEDPDYYGQEIWSREPYYARPEPELETFSPPLP 
AGPGEEWERRPQEPRPPKRATKPKKAPKREKSAPEPPPPGKHSNKKVMRTKSSEKAANDDHS 
VRVAREDVRESCPPLGLETLKITDFQLHASTVKRYGLGAHRGRLNIQAGINENDFYDGAWCA 
GRNDLQQWIEVDARRLTRFTGVITQGRNSLWLSDWTSYKVMVSNDSHTVAm^GSGDMIF 
EGNSE KE I PVLNELPVPMVARY I R I NPQSWFDNGS I CMRME I LGCPLPDPNNYYHRRNEMTT 
TDDLDFKHHNYKEMRQIiMKVVNEMCPNI TR I YNI GKSHQGLKLYAVE I SDHPGEHEVGEPEF 
HY I AGAHGNEVLGRELLLLLVQ FVCQE YI^RNAR I VHLVEETRIHVLPSLNPDG YEKAYEGG 
SELGGWSLGRWTHDGIDINNNFPDLOTLLWEAEDRQNVPRKVPNHYIAIPEWFLSENATVAA 
ETRAVIAWMEKIPFVLGGNLQGGELWAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYAST 
HRLMTDARRRVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHES 
CI.PEEWENNRESLIVFMEQVHRGIKGLVRDSHGKGIPNAIISVEGINHDIRTANDGDYWRLL 
NPGEYVVTAKAEGFTASTKNCMVGYDMGATRCDFTLSKTNMAR-IREIMEKFGKQPVSLPARR 
LKLRGRKRRQRG 
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FTGTJRE 37 

CTAAGAGGACAAGATGAGGCCCGGCCTCTCATTTCTCCTAGCCCTTCTGTTCTTCCTTGGCC 
AAGCTGCAGGGGATTTGGGGGATGTGGGACCTCCAATTCCCAGCCCCGGCTTCAGCTCTTTC 
CCAGGTGTTGACTCCAGCTCCAGCTTCAGCTCCAGCTCCAGGTCGGGCTCCAGCTCCAGCCG 
CAGCTTAGGCAGCGGAGGTTCTGTGTCCCAGTTGTTTTCCAATTTCACCGGCTCCGTGGATG 
ACCGTGGGACCTGCCAGTGCTCTGTTTCCCTGCCAGACACCACCTTTCCCGTGGACAGAGTG 
GAACGCTTGGAATTCACAGCTCATGTTCTTTCTCAGAAGTTTGAGAAAGAACTTTCTAAAGT 
GAGGGAATATGTCCAATTAATTAGTGTGTATGAAAAGAAACTGTTAAACCTAACTGTCCGAA 
TTGACATCATGGAGAAGGATACCATTTCTTACACTGAACTGGACTTCGAGCTGATCAAGGTA 
GAAGTGAAGGAGATGGAAAAACTGGTCATACAGCTGAAGGAGAGTTTTGGTGGAAGCTCAGA 
AATTGTTGACCAGCTGGAGGTGGAGATAAGAAATATGACTCTCTTGGTAGAGAAGCTTGAGA 
CACTAGACAAAAACAATGTCCTTGCCATTCGCCGAGAAATCGTGGCTCTGAAGACCAAGCTG 
AAAGAGTGTGAGGCCTCTAAAGATCAAAACACCCCTGTCGTCCACCCTCCTCCCACTCCAGG 
GAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGTGGTTCAGCTCAACTGGA 
GAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTCCCCAGCATCCAAACAAA 
GGACTGTATTGGGTGGCGCCATTGAATACAGATGGGAGACTGTTGGAGTATTATAGACTGTA 
CAACACACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTTGCGGATCACCTATGGCC 
AAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACATGTACAACACCGGGAAT 
ATTGCCAGAGTTAACCTGACCACCAACACGATTGCTGTGACTCAAACTCTCCCTAATGCTGC 
CTATAATAACCGCTTTTCATATGCTAATGTTGCTTGGCAAGATATTGACTTTGCTGTGGATG 
AGAATGGATTGTGGGTTATTTATTCAACTGAAGCCAGCACTGGTAACATGGTGATTAGTAAA 
CTCAATGACACCACACTTCAGGTGCTAAACACTTGGTATACCAAGCAGTATAAACCATCTGC 
TTCTAACGCCTTCATGGTATGTGGGGTTCTGTATGCCACCCGTACTATGAACACCAGAACAG 
AAGAGATTTTTTACTATTATGACACAAACACAGGGAAAGAGGGCAAACTAGACATTGTAATG 
CATAAGATGCAGGAAAAAGTGCAGAGCATTAACTATAACCCTTTTGACCAGAAACTTTATGT 
CTATAACGATGGTTACCTTCTGAATTATGATCTTTCTGTCTTGCAGAAGCCCCAGTA^GCTG 
TTTAGGAGTTAGGGTGAAAGAGAAAATGTTTGTTGAAAAAATAGTCTTCTCCACTTACTTAG 
ATATCTGCAGGGGTGTCTAAAAGTGTGTTCATTTTGCAGCAATGTTTAGGTGCATAGTTCTA 
CCACACTAGAGATCTAGGACATTTGTCTTGATTTGGTGAGTTCTCTTGGGAATCATCTGCCT 
CTTCAGGCGCATTTTGCAATAAAGTCTGTCTAGGGTGGGATTGTCAGAGGTCTAGGGGCACT 
GTGGGCCTAGTGAAGCCTACTGTGAGGAGGCTTCACTAGAAGCCTTAAATTAGGAATTAAGG 
AACTTAAAACTCAGTATGGCGTCTAGGGATTCTTTGTACAGGAAATATTGCCCAATGACTAG 
TCCTCATCCATGTAGCACCACTAATTCTTCCATGCCTGGAAGAAACCTGGGGACTTAGTTAG 
GTAGATTAATATCTGGAGCTCCTCGAGGGACCAAATCTCCAACTTTTTTTTCCCCTCACTAG 
CACCTGGAATGATGCTTTGTATGTGGCAGATAAGTAAATTTGGCATGCTTATATATTCTACA 
TCTGTAAAGTGCTGAGTTTTATGGAGAGAGGCCTTTTTATGCATTAAATTGTACATGGCAAA 
TAAATCCCAGAAGGATCTGTAGATGAGGCACCTGCTTTTTCTTTTCTCTCATTGTCCACCTT 
ACTAAAAGTCAGTAGAATCTTCTACCTCATAACTTCCTTCCAAAGGCAGCTCAGAAGATTAG 
AACCAGACTTACTAACCAATTCCACCCCCCACCAACCCCCTTCTACTGCCTACTTTAAAAAA 
ATTAATAGTTTTCTATGGAACTGATCTAAGATTAGAAAAATTAATTTTCTTTAATTTCATTA 
TGGACTTTTATTTACATGACTCTAAGACTATAAGAAAATCTGATGGCAGTGACAAAGTGCTA 
GCATTTATTGTTATCTAATAAAGACCTTGGAGCATATGTGCAACTTATGAGTGTATCAGTTG 
TTGCATGTAATTTTTGCCTTTGTTTAAGCCTGGAACTTGTAAGAAAATGAAAATTTAATTTT 
TTTTTCTAGGACGAGCTATAGAAAAGCTATTGAGAGTATCTAGTTAATCAGTGCAGTAGTTG 
GAAACCTTGCTGGTGTATGTGATGTGCTTCTGTGCTTTTGAATGACTTTATCATCTAGTCTT 
TGTCTATTTTTCCTTTGATGTTCAAGTCCTAGTCTATAGGATTGGCAGTTTAAATGCTTTAC 
T C C C C CTTTTAAAATAAATGATTAAAATGTG CTT TGAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MRPGLSFLLALLFFLGQAAGDLGDVGPPIPSPGFSSFPGVDSSSSFSSSSRSGSSSSRSLGS 

GGSVSQLFSNFTGSVDDRGTCQCSVSLPDTTFPVDRVERLEFTAHVLSQKFEKELSKVREYV 

QLISVYEKKLLNLTVRIDIMEKDTISYTELDFELIKVEVKEMEKLVIQLKESFGGSSEIVDQ 

LETVEIRI^TLLVEKLETLDKNNVLAIRREIVALKTKLKECEASKDQNTPVVH 

GGVVNISKPSWQLNWRGFSYLYGAWGRDYSPQHPNKGLYWVAPLOTDGRLLEYYRLYNTLD 

DLLLYINARELRITYGQGSGTAVYNNNMYVNMYNTGNIARVNLTTNTIAOT 

FSYANVAWQDIDFAVDENGLWIYSTEASTGNMVISKLNDTTLQVLNTWYTKQYKPSASNAF 

MVCGVLYATRTMNTRTEE I FYYYDTNTGKEGKLDI VMHKMQEKVQS INYNPFDQKLYVYNDG 

YLLNYDLSVLQKPQ 
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FIGURE 39 

GCTCTGAAGACCAAGCTGAAAGAGTGTGAGGCCTCTAAAGATCAAACACCCCTGTCGTCCAC 
CCTCCTCCCACTCCAGGGAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGT 
GGTTCAGCTCAACTGGAGAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTC 
CCCAGCATCCAAACAAAGGNATGTATTGGGNGGCGCCATTGAATACAGATGGGAGACTGTTG 
GAGTATTATAGACTGTACAACCCACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTT 
GCGGATCACCTATGGCCAAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACA 
TGTACAACACCGGGNATATTGCCAGAGTTAACCTGACC 
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FIGURE 40 

TCTCGCAGATAGTAAATAATCTCGGAAAGGCGAGAAAGAAGCTGTCTCCATCTTGTCTGTAT 
CCGCTGCTCTTGTGACGTTGTGGAGATGGGGAGCGTCCTGGGGCTGTGCTCCATGGCGAGCT 
GGATACCATGTTTGTGTGGAAGTGCCCCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAAC 
AACTCCACTGTAACTAGATTGATCTATGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTG 
TGTAATGTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATG 
AGAAAGGTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTT 
GGTTTGGCTATGTTCTATCTTCTTCTCTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGA 
TCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTA 
TTATTGGGGCATTCTTCATTCCAGAAGGAACTTTTACAACTGTGTGGTTTTATGTAGGCATG 
GCAGGTGCCTTTTGTTTCATCCTCATACAACTAGTCTTACTTATTGATTTTGCACATTCATG 
GAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGAACTCGAGATGTTGGTATGCAGCCTTGT 
TATCAGCTACAGCTCTGAATTATCTGCTGTCTTTAGTTGCTATCGTCCTGTTCTTTGTCTAC 
TAC^CTttTCCAGCCAGTTGTTCAGAAAACAAGGCGTTCATCAGTGTCAACATGCTCCTCTG 
CGTTGGTGCTTCTGTAATGTCTATACTGCCAAAAATCCAAGAATCACAACCAAGATCTGGTT 
TGTTACAGTCTTCAGTAATTACAGTCTACACAATGTATTTGACATGGTCAGCTATGACCAAT 
GAACCAGAAACAAATTGCAACCCAAGTCTACTAAGCATAATTGGCTACAATACAACAAGCAC 
TGTC CCAAAGGAAGGG CAGT CAGT CCAGTGGTGG CATG CT CAAGGAAT TATAGGACTAATTC 
TCTTTTTGTTGTGTGTATTTTATTCCAGCATCCGTACTTCAAAC!AATAGTCAGGTTAATAAA 
CTGACTCTAACAAGTGATGAATCTACATTAATAGAAGATGGTGGAGCTAGAAGTGATGGATC 
ACTGGAGGATGGGGACGATGTTCACCGAGCTGTAGATAATGAAAGGGATGGTGTCACTTACA 
GTTATTCCTTCTTTCACTTCATGCTTTTCCTGGCTTCACTTTATATCATGATGACCCTTACC 
AA CTGGT C CAGGT ATGAA CC CT CT CGTG AG ATGAAAAGT CAGTGGACAGCTGTCTGGGTGAA 
AATCTCTTCCAGTTGGATTGGCATCGTGCTGTATGTTTGGACACTCGTGGCACCACTTGTTC 
TTACAAATCGTGATTTTGACT^GTGAGACTTCTAGCATGAAAGTCCCACTTTGATTATTGC 
TTATTTGAAAACAGTATTCCCAACTTTTGTAAAGTTGTGTATGTTTTTGCTTCCCATGTAAC 
TTCTCCAGTGTTCTGGC^TGAATTAGATTTTACTGCTTGTCATTTTGTTATTTTCTTACCAA 
GTGCATTGATATGTGAAGTAGAATGAATTGCAGAGGAAAGTTTTATGAATATGGTGATGAGT 
TAGTAAAAGTGGCCATTATTGGGCTTATTCTCTGCTCTATAGTTGTGAAATGAAGAGTAAAA 
ACAAATTTGTTTGACTATTTTAAAATTATATTAGACCTTAAGCTGTTTTAGCAAGCATTAAA 
GCAAATGTATGGCTGCCTTTTGAAATATTTGATG TGTTGC CTGG CAGGATACTG CAAAGAAC 
ATGGTTTATTTTAAAATTTATAAACAAGTCACTTAAATGCCAGTTGTCTGAAAAATCTTATA 
AGGTTITACCCTTGATACGGAATTTACACAGGTAGGGAGTGTTT^ 

TGGATGGAGGTGTCGGTACTAAATTGAATAACGAGTAAATAATCTTACTTGGGTAGAGATGG 
CCTTTGCCAACAAAGTGAACTGTTTTGGTTGTTTTAAACTCATGAAGTATGGGTTCAGTGGA 
AATGTTTGGAACTCTGAAGGATTTAGACAAGGTTTTGAAAAGGATAATCATGGGTTAGAAGG 
AAGTGTTTTGAAAGTCACTTTGAAAGTTAGTTTTGGGCCCAGCACGGTAGCTCACCCTTGGT 
AATCCCAGCACTTTGGGAGCTTAAGTGGGTAGATTACTTGAGCCCAGGAATTCAGACCAGCT 
TGGCACATGGTGAACCTGTTCTATAAAAATAATCTGGCTTTGAGCATATGCCTGTGGTCCAG 
CACTGAGAGGCTAGTGAAGATTGCTGAGCCCAGAGCCAAAGGTTGCAGTGAGCAAGTCACGT 
CACTGCACTCTAGCTGGCACAGAGTAAGCCAAAAAAATATATATATATTGAAATCAAGGAGG 
CAAAATTTTGACAGGGAAGGAAGTAACTGCAAAACCACTAGGCTTTAGTAGGTACTTATATA 
AAATCTAGTCCAGTTCTCTCATTTAAAAAAATGAAGACACTGAAATACAGACTTAAATAGCT 
C AGATAG CTAATTAGG AAATTT CAAGTT GG C C AATAAT AG CATT CT CT CTGACATTTAAAAA 
TAATTTCTATTCAAAATACATGCATATTGATTTACACCTCATACTGTGATAATTAATGTGAT 
GTGGATTGCTGGTGTCCAGCATGACCCATAAACAGGTCAGAAGAATGATGGAATGTTTTAGA 
ATAAACTCCTGCTTATAGTATACTACACAGTTCAAAAGATGTTTAAAATGCTTTTGTATTTA 
CTGCCATGTAATTGAAATATATAGATTATTGTAACCTTTCAACCTGAAAATCAAGCAGTATG 
AGAGTTTAGTTATTTGTATGTGTCACTAGTGTCTAATGAAGCTTTTAAAATCTACAATTTCT 
TCTTTAAAAATATTTATTAATGTGAATGGAATATAACAATTCAGCTTAATTCCCCAACCTTA 
TTCTGTGTGTAGACATTGTATTCCACAATTTTGAATGGCTGTGTTTTACCTCTAAATAAATG 
AATTCAGAGAAAAAAAAAAAAAAA 
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FIGURE 41 

MGSVLGLCSMASWIPCLCGSAPCLLCRCCPSGNNSTVTRLIYALFLLVGVCVACVMLIPGME 
EQLNKIPGFCENEKGWPCNILVGYKAVYRLCFGLAMFYLLLSLLMIKVKSSSDPRAAVHNG 
FWFFKFAAAI AI I IGAFF I PEGTFTTVWFYVGMAGAFCF I LI QLVLLI DFAHSWNESWVEKM 
EEGNSRCWYAALLSATALNYLLSLVAIVLFFVTC^ 

LPKIQESQPRSGLLQSSVITVYTMYLTWSAMTNEPETNCNPSLLSIIGYNTTSTVPKEGQSV 
QWWHAQGI IGLILFLLCVFYSSIRTSNNSQVNKLTLTSDESTLIEDGGARSDGSLEDGDDVH 
RAVDNERDGVTYSYSFFHFMLFLASLYIMMTLTNWSRYEPSREMKSQWTAVWVKISSSWIGI 
VL YVWTLVAPLVLTNRD FD 
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FIGURE 42 

GCGAGAAAGAAGCTGTCTCCATCTTGTCTGTATCCCGCTGCTTCTTGNGACGTTGTGGAGAT 
GGGGAGCGTCCCTGGGGCTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCC 
CCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAACAANTCCACTGTAACTAGATTGATCTA 
TGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAG 
AACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATT 
TTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCT 
CTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGAT 
TTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 43 

GTTATTGTGAACTTTGTGGAGATGGGAGGTCNTGGGGCTGTGTTCCATGGCGAGCTGGATAC 
CANGTTTGTGTGGAAGTGCCCCGTGTTTGNTATGCCGATGCTGTCCTAGTGGAAACAANTCC 
ACTGTAATTAGATTGATNTATGCACTTTTNTTGCTTGTTGGAGTANGTGTAGCTTGTGTAAT 
GTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAG 
GTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATNGTTTGTGCTTTGGTTTG 
GCTANGTTCTATNTTCTTCTCTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAG 
AGCTGCAGTGCACAATGGATTTTGGTTTTTTAAATTTGCTGCAGCAATTGCAATTATTATTG 
GGGC 
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FIGURE 44 

AAGAAGCTGTCTCCATCTTGTCTGTATCCGCTGCTCTTGTGAACGTTNTGGAGATGGGGAGC 
GTCCTTGGGGTTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCCCCGTGTT 
TGCTATGCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACTAGATTGATCTATGCACTT 
TTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAGAACAACT 
GAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATTTTGGTTG 
GCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTA 
CTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTT 
CTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 45 

GCTGTCCTTAGTGGAAACAANTCCAACTTGTAACTTGGATTGATCTATGCACTTTTTCCTTG 
CTTGTTGGAGTATGTGTAGCTTTGTGTAATGTTGTTCCCAGGATTGGANGAACAACTGAATA 
AGATTCCTGGATTTTTGTGAGAATGAGAAAGGTGTTGTCCCCTTGTAACATTTTTGGTTGGC 
TATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTACT 
AATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCT 
TTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGCATTCTTCATTCCAGAAGGAACTTTT 
ACAACTGTGTGGTTTTATGTAGGCATGGCAGGTGCCTTTTGTTTCATCCTCATACAACTAGT 
CTTACTTATTGATTTTGCACATTCATGGAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGA 
ACTCGAGATGTTGGTATGCAGCCTTGTTATCAGCTACAGCTCTGAATTATCTGCTGTCTTTA 
GTTGCTATCGTCCTGTTCTTTGTCTACTACACTCATCCAGCCAGTTGTTCAGAAAACAAGGC 
GTTCATCAGTGTCAACATGCTCCTCTGCGTTGGTGCTTCTGTAATG 
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FIGURE 46A 

CTCGGGCGCGCACAGGCAGCTCGGTTTGCCCTGCGATTGAGCTGCGGGTCGCGGCCGGCGCC 
GGCCTCTCCAATGGCAAATGTGTGTGGCTGGAGGCGAGCGCGAGGCTTTCGGCAAAGGCAGT 
CGAGTGTTTGCAGACCGGGGCGAGTCCTGTGAAAGCAGATAAAAGAAAACATTTATTAACGT 
GTCATTACGAGGGGAGCGCCCGGCCGGGGCTGTCGCACTCCCCGCGGAACATTTGGCTCCCT 
CCAGCTCCGAGAGAGGAGAAGAAGAAAGCGGAAAAGAGGCAGATTCACGTCGTTTCCAGCCA 
AGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTATTAGCGATGCCCC 
CTGGTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCTGGCTCGCTTCCCTCCCTCGT 
TTCCAGCTCCTGGGCGAATCCCACATCTGTTTCAACTCTCCGCCGAGGGCGAGCAGGAGCGA 
GAGTGTGTCGAATCTGCGAGTGAAGAGGGACGAGGGAAAAGAAACAAAGCCACAGACGCAAC 
TTGAGACTCCCGCATCCCAAAAGAAGCACCAGATCAGCAAAAAAAGAAGATSGGCCCCCCGA 
GCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCTGCTGGGTGGAAGCTCGGCCTTC 
CTGTCGCACCACCGCCTGAAAGGCAGGTTTCAGAGGGACCGCAGGAACATCCGCCCCAACAT 
CATCCTGGTGCTGACGGACGACCAGGATGTGGAGCTGGGTTCCATGCAGGTGATGAACAAGA 
CCCGGCGCATCATGGAGCAGGGCGGGGCGCACTTCATCAACGCCTTCGTGACCACACCCATG 
TGCTGCCCCTCACGCTCCTCCATCCTC71CTGGCAAG 

CAACAATGAGAACTGCTCCTCGCCCTCCTGGCAGGCACAGCACGAGAGCCGCACCTTTGCCG 
TGTACCTCAATAGCACTGGCTACCGGACAGCTTTCTTCGGGAAGTATCTTAATGAATACAAC 
GGCTCCTACGTGCCACCCGGCTGGAAGGAGTGGGTCGGACTCCTTAAAAACTCCCGCTTTTA 
TAACTACACGCTGTGTCGGAACGGGGTGAAAGAGAAGCACGGCTCCGACTACTCCAAGGATT 
ACCTCACAGACCTCATCACCAATGACAGCGTGAGCTTCTTCCGCACGTCCAAGAAGATGTAC 
CCGCACAGGCCAGTCCTCATGGTCATCAGCCATGCAGCCCCCCACGGCCCTGAGGATTCAGC 
CCCACAATATTCACGCCTCTTCCCAAACGCATCTCAGCACATCACGCCGAGCTACAACTACG 
CGCCCAACCCGGACAAACACTGGATCATGCGCTACACGGGGCCCATGAAGCCCATCCACATG 
GAATTCACCAACATGCTCCAGCGGAAGCGCTTGCAGACCCTCATGTCGGTGGACGACTCCAT 
GGAGACGATTTACAACATGCTGGTTGAGACGGGCGAGCTGGACAACACGTACATCGTATACA 
CCGCCGACCACGGTTACCACATCGGCCAGTTTGGCCTGGTGAAAGGGAAATCCATGCCATAT 
GAGTTTGACATCAGGGTCCCGTTCTACGTGAGGGGCCCCAACGTGGAAGCCGGCTGTCTGAA 
TCCCCACATCGTCCTCAACATTGACCTGGCCCCCACCATCCTGGACATTGCAGGCCTGGACA 
TACCTGCGGATATGGACGGGAAATCCATCCTCAAGCTGCTGGACACGGAGCGGCCGGTGAAT 
CGGTTTCACTTGAAAAAGAAGATGAGGGTCTGGCGGGACTCCTTCTTGGTGGAGAGAGGCAA 
GCTGCTACACAAGAGAGACAATGACAAGGTGGACGCCCAGGAGGAGAACTTTCTGCCCAAGT 
ACCAGCGTGTGAAGGACCTGTGTCAGCGTGCTGAGTACCAGACGGCGTGTGAGCAGCTGGGA 
CAGAAGTGGCAGTGTGTGGAGGACGCCACGGGGAAGCTGAAGCTGCATAAGTGCAAGGGCCC 
CATGCGGCTGGGCGGCAGCAGAGCCCTCTCCAACCTCGTGCCCAAGTACTACGGGCAGGGCA 
GCGAGGCCTGCACCTGTGACAGCGGGGACTACAAGCTCAGCCTGGCCGGACGCCGGAAAAAA 
CTCTTCAAGAAGAAGTACAAGGCCAGCTATGTCCGCAGTCGCTCCATCCGCTCAGTGGCCAT 
CGAGGTGGACGGCAGGGTGTACCACGTAGGCCTGGGTGATGCCGCCCAGCCCCGAAACCTCA 
CCAAGCGGCACTGGCCAGGGGCCCCTGAGGACCAAGATGACAAGGATGGTGGGGACTTCAGT 
GGCACTGGAGGCCTTCCCGACTACTCAGCCGCCAACCCCATTAAAGTGACACATCGGTGCTA 
CATCCTAGAGAACGACACAGTCCAGTGTGACCTGGACCTGTACAAGTCCCTGCAGGCCTGGA 
AAGACCACAAGCTGCACATCGACCACGAGATTGAAACCCTGCAGAACAAAATTAAGAACCTG 
AGGGAAGTCCGAGGTCACCTGAAGAAAAAGCGGCCAGAAGAATGTGACTGTCACAAAATCAG 
CTACCACACCCAGCACAAAGGCCGCCTCAAGCACAGAGGCTCCAGTCTGCATCCTTTCAGGA 
AGGGCCTGCAAGAGAAGGACAAGGTGTGGCTGTTGCGGGAGCAGAAGCGCAAGAAGAAACTC 
CGCAAGCTGCTCAAGCGCCTGCAGAACAACGACACGTGCAGCATGCCAGGCCTCACGTGCTT 
CACCCACGACAACCAGCACTGGCAGACGGCGCCTTTCTGGACACTGGGGCCTTTCTGTGCCT 
GCACCAGCGCCAACAATAACACGTACTGGTGCATGAGGACCATCAATGAGACTCACAATTTC 
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FIGURE 46B 

CTCTTCTGTGAATTTGCAACTGGCTTCCTAGAGTACTTTGATCTCAACACAGACCCCTACCA 
GCTGATGAATGCAGTGAACACACTGGACAGGGATGTCCTCAACCAGCTACACGTACAGCTCA 
TGGAGCTGAGGAGCTGCAAGGGTTACAAGCAGTGTAACCCCCGGACTCGAAACATGGACCTG 
GATGGAGGAAGCTATGAGCAATACAGGCAGTTTCAGCGTCGAAAGTGGCCAGAAATGAAGAG 
ACCTTCTTCCAAATCACTGGGACAACTGTGGGAAGGCTGGGAAGGTTAAGAAACAACAGAGG 
TGGACCTCC^AAAACATAGAGGCATCACCTGACTGCACAGGCAATGAAAAACCATGTGGGTG 
ATTTCCAGCAGACCTGTGCTATTGGCCAGGAGGCCTGAGAAAGCAAGCACGCACTCTCAGTC 
AACATGACAGATTCTGGAGGATAAC CAG CAGGAGCAGAGATAACTTCAGGAAGTCCATTTTT 
GCCCCTGCTTTTGCTTTGGATTATACCTCACCAGCTGCACAAAATGCATTTTTTCGTATCAA 
AAAGTCACCACTAACCCTCCCCCAGAAGCTCACAAAGGAAAACGGAGAGAGCGAGCGAGAGA 
GATTTCCTTGGAAATTTCTCCCAAGGGCGAAAGTCATTGGAATTTTTAAATCATAGGGGAAA 
AGCAGTCCTGTTCTAAATCCTCTTATTCTTTTGGTTTGTCACAAAGAAGGAACTAAGAAGCA 
GGACAGAGGCAACGTGGAGAGGCTGAAAACAGTGCAGAGACGTTTGACAATGAGTCAGTAGC 
ACAAAAGAGATG ACATTTAC CTAG CACT AT AAAC C CTGGT TG CCTCTGAAGAAACTGCCTTC 
ATTGTATATATGTGACTATTTACATGTAATCAACATGGGAACTTTTAGGGGAACCTAATAAG 
AAAT C CCAATTTT CAGGAGTGGTGGTGTCAATAAACG CTCTGTGG CCAGTGTAAAAGAAAAA 
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FIGURE 47 

mgppslvlcllsatvfsllggssaflshhrlkgrfqrdrrnirpni ilvltddqdvelgsmq 
vmnktrrimeqggahfinafvttpmccp 

rtfavylnstgyrtaffgkylneyngsyvppgwkewgllknsrfynytlcrngvkekhgsd 
yskdyltdlitndsvsffrtskkmyphrpviwishaaphgpedsapqysrlfpnasqhitp 
syotapnpdkhwimrytgpmkpihmeftnmlqr 

yiwtadhgyhigqfglvkgksmpyefdirvpfyvrgpnveagclnphivlnidlaptildi 
agld i p admdgks i lklldterpvnrfhlkkkmrvwrds flvergkllhkrdndkvdaqeen 
flpkyqrvkdlcqraeyqtaceqlgqkwqcvedatgklkljikckgpmrlggsralsnlvpky 
ygqgseactcdsgdykxjsiagrrkklfkiocykasytosrsirsvaietogrvyhvglgdaaq 
prnltkrhwpgapedqddkdggdfsgtgglpdys aanp i kvthrcy i lendtvqcdldlyks 
lq awkdhklhi dhei etlqnk i knlre vrghl kkkr peecdchk i s yhtqhkgrlkhrgs s l 
hpfrkglqekdkvwllreqkrkjcklrkiilkrl^ 
pfcactsannntywovirtinethnflfc^ 

hvqlmelrsckgykqcnprtrnmdldggsyeqyrqfqrrkwpemkrpsskslgqlwegweg 



WO 99/63088 PCT/US99/12252 

FIGURE 48 

AACAAAGTTCAGTGACTGAGAGGGCTGAGCGGAGGCTGCTGAAGGGGAGAAAGGAGTGAGGA 

GCTGCTGGGCAGAGAGGGACTGTCCGGCTCCCAGATGCTGGGCCTCCTGGGGAGCACAGCCC 

TCGTGGGATGGATCACAGGTGCTGCTGTGGCGGTCCTGCTGCTGCTGCTGCTGCTGGCCACC 

TGCCTTTTCCACGGACGGCAGGACTGTGACGTGGAGAGGAACCGTACAGCTGCAGGGGGAAA 

CCGAGTCCGCCGGGCCCAGCCTTGGCCCTTCCGGCGGCGGGGCCACCTGGGAATCTTTCACC 

ATCACCGTCATCCTGGCCACGTATCTCATGTGCCGAATGTGGGCCTCCACCACCACCACCAC 

CCCCGCCACACCCCTCACCACCTCCACCACCACCACCACCCCCACCGCCACCATCCCCGCCA 

CGCTCGCTgAGGCTGCTGTCGCCGGTGCCTGTGGACAGCAGCTGCCCCTGCCCTCCCATCTG 

TTCCCAGGACAAGTGGACCCCATGTTTCCATGTGGAAGGATGCATCTCTGGGGTGAACGAGG 

GGAACAATAGACTGGGGCTTGCTCCAGCTGCATTTGCATGGCATGCCCCAGTGTACTATGGC 

AGCAGAGAATGGAGGAACACTGGGTCTGCAGTGCTGAAGGGTTTGGGGAGTGGAGAGCAAGG 

GTGCTCTTTCGGGGCTGGACAGCCCGTCTTGTGACAGTGACTCCCAGTGAGCCCCAGAAATG 

ACAAGCGTGTCTTGGCAGAGCCAGCACACAAGTGGATGTGAAGTGCCCGTCTTGACCTCCTC 

ATCAGGCTGCTGCAGGCCTCTGGCGGGCAGGGCACTGGGAGAGGCCCTGAGAATGTCCTTTT 

GGTTTGGAGAAGGCAGTGTGAGGCTGCACAGTCAATTCATCGGTGCCTTAGTCCAAGAAAAT 

AAAAACCACTAAGAAGCTTTAAAAAAAAAAAAAAAAAAAAA 
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MLGLLGSTALVGWITGAAVAVLLLLLLIiATCLFHGRQDCDVERNRTAAGGNRVRRAQPWPFR 
RRGHLGIFHHHRHPGHVSHVPNVGUiHH^^PRHTPHHLHHHHHPHRHHPRHAR 
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FIGURE 50 

GGCGGCTGCTGAGCTGCCTTGAGGTGCAGTGTTGGGGATCCAGAGCCATGTCGGACCTGCTA 

CTACTGGGCCTGATTGGGGGCCTGACTCTCTTACTGCTGCTGACGCTGCTGGCCTTTGCCGG 

GTACTCAGGGCTACTGGCTGGGGTGGAAGTGAGTGCTGGGTCACCCCCCATCCGCAACGTCA 

CTGTGGCCTACAAGTTCCACATGGGGCTCTATGGTGAGACTGGGCGGCTTTTCACTGAGAGC 

TGCAGCATCTCTCCCAAGCTCCGCTCCATCGCTGTCTACTATGACAACCCCCACATGGTGCC 

CCCTGATAAGTGCCGATGTGCCGTGGGCAGCATCCTGAGTGAAGGTGAGGAATCGCCCTCCC 

CTGAGCTCATCGACCTCTACCAGAAATTTGGCTTCAAGGTGTTCTCCTTCCCGGCACCCAGC. 

CATGTGGTGACAGCCACCTTCCCCTACACCACCATTCTGTCCATCTGGCTGGCTACCCGCCG 

TGTCCATCCTGCCTTGGACZACCTACATCAAGGAGCGGAAGCTGTGTGCCTATCCTCGGCTGG 

AGATCTACCAGGAAGACCAGATCCATTTCATGTGCCCACTGGCACGGCAGGGAGACTTCTAT 

GTGCCTGAGATGAAGGAGACAGAGTGGAAATGGCGGGGGCTTGTGGAGGCCATTGACACCCA 

GGTGGATGGCACAGGAGCTGACACAATGAGTGACACGAGTTCTGTAAGCTTGGAAGTGAGCC 

CTGGCAGCCGGGAGACTTCAGCTGCCACACTGTCACCTGGGGCGAGCAGCCGTGGCTGGGAT 

GACGGTGACACCCGCAGCGAGCACAGCTACAGCGAGTCAGGTGCCAGCGGCTCCTCTTTTGA 

GGAGCTGGACTTGGAGGGCGAGGGGCCCTTAGGGGAGTCACGGCTGGACCCTGGGACTGAGC 

CCCTGGGGACTACCAAGTGGCTCTGGGAGCCCACTGCCCCTGAGAAGGGCAAGGAGTAACCC 

ATGGCCTGCACCCTCCTGCAGTGCAGTTGCTGAGGAACTGAGCAGACTCTCCAGCAGACTCT 

CCAGCCCTCTTCCTCCTTCCTCTGGGGGAGGAGGGGTTCCTGAGGGACCTGACTTCCCCTGC 

TCCAGGCCTCTTGCTAAGCCTTCTCCTCACTGCCCTTTAGGCTCCCAGGGCCAGAGGAGCCA 

GGGACTATTTTCTGCACCAGCCCCCAGGGCTGCCGCCCCTGTTGTGTCTTTTTTTCAGACTC 

ACAGTGGAGCTTCCAGGACCCAGAATAAAGCCAATGATTTACTTGTTTCACCTGGAAAAAAA 
AAAAAAAAAA 
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FIGURE 51 

MSDLLLLGLIGGLTLLLLLTLLAFAGYSGLLAGVEVSAGSPPIRNVTVAYKFHMGLYGETGR 
LFTESCSISPKLRSIAVYYDNPHMVPPDKCRCAVGSILSEGEESPSPELIDLYQKFGFKVFS 
FPAPSHWTATFPYTTILSIWLATRRVHPALDTYIKERKLCAYPRLEIYQEDQIHFMCPLAR 
QGDFYVPEMKETEWKWRGLVEAIDTQVDGTGADTMSDTSSVSLEVSPGSRETSAATLSPGAS 
SRGWDDGDTRSEHSYSESGASGSSFEEIi3LEGEGPL^ESRIi3PGTEPLGTTKWLWEPTAPEKGKE 
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FIGURE 52 

CCGCGGGAACGCTGTCCTGGCTGCCGCCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCT 
GCCCCGCGCCCAGTCATGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCT 
GCTGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGA 
CCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGA 
GACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCT 
GACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGA 
GTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTAT 
GGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCT 
GATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAG 
GGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAAT 
AG AC C C AAAG TC TC CAAAAAGAAG CT CAAGGAAGAGAAACGAAA CAAGAG CAAAAAGAAATA 
ATAAATAATAAATTTTAAAAAACTTAAAAAAAAAAAAAAAAAA 
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FIGURE 53 

MTLRPSLLPLHLLLLLLLSAAVCRAEAGLETESPVRTLQVETLVEPPEPCAEPAAFGDTLHI 
HYTGSLVDGRIIDTSLTRDPLVIELGQKQVIPGLEQSLLDMCVGEKRRAI IPSHLAYGKRGF 
PPSVPADAWQYDVEL I ALIRANYWLKLVKG I LPLVGMAMVP ALLGLI GYHLYRKANRPKVS 
KKKLKEEKRNKSKKK 
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FIGURE 54 

CCCGGGAACGTGTTCCTGGCTGCCGCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCTGC 
CCCGCGCCCAGTCATGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCTGC 
TGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGACC 
CTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGAGA 
CACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCTGA 
CCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGAGT 
CTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTATGG 
AAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCTGA 
TTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAGGG 
ATGGCCATGGTGCCACCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAATAGA 
CCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAGCAAAAAGAAATAATA 
AATAATAAATTTTAAAAAACTTA 
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FIGURE 55 

CCGAAAGTCCCGTCCGGACCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCC 
GAGCCCGCTGCTTTTGGAGACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACG 
TATTATTGACACCTCCCTGACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGA 
TTCCAGGTCTGGAGCAGAGTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATT 
CCTTCTCACTTGGCCTATGGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGT 
GCAGTATGACGTGGAGCTGATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGG 
GCATTTTGCCTCTGGTAGGGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCAC 
CiATACAGAAAGGCCAATAGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAA 
CAAG AG C AAAAAG AAAT AATAAATAAT AAATTTT AAAAAA CTTAAAA 
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FIGURE 56 

CTGCTGCATCCGGGTGTCTGGAGGCTGTGGCCGTTTTGTTTTCTTGGCTAAAATCGGGGGAG 

TGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACCACTGCACGACGGGGCTGGACTG 

ACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGG 

GAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGAT 

TATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCT 

GTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGA 

GGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGG 

TTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTG 

CTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATCTTT 

rTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGACTTATGGCAGTGAACACATCTGAT 

TTCCCACAGCACAACAGCCCTGCATGGGTTTGTTTGTTTTTTTACTGCTCACTCCCAACCTT 

TTGTAATGCCATTTTCTAAACTTATTTCTGAGTGTAGTCTCAGCTTAAAGTTGTGTAATACT 

AAAATCACGAGAACACCTAAACAACAACCAAAAATCTATTGTGGTATGCACTTGATTAACTT 

ATAAAATGTTAGAGGAAACTTTCACATGAATAATTTTTGTCAAATTTTATCATGGTATAATT 

TGTAAAAATAAAAAGAAATTACAAAAGAAATTATGGATTTGTCAATGTAAGTATTTGTCATA 

TCTGAGGTCCAAAACCACAATGAAAGTGCTCTGAAGATTTAATGTGTTTATTCAAATGTGGT 

CTCTTCTGTGTCAAATGTTAAATGAAATATAAACATTTTTTAGTTTTTAAAATATTCCGTGG 

TCAAAATTCTTCCTCACTATAATTGGTATTTACTTTTACCAAAAATTCTGTGAACATGTAAT 

GTAACTGGCTTTTGAGGGTCTCCCAAGGGGTGAGTGGACGTGTTGGAAGAGAGAAGCACCAT 

GGTCCAGCCACCAGGCTCCCTGTGTCCCTTCCATGGGAAGGTCTTCCGCTGTGCCTCTCATT 

CCAAGGGCAGGAAGATGTGACTCAGCCATGACACGTGGTTCTGGTGGGATGCACAGTCACTC 

CACATCCACCACTG 
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FIGURE 57 

MSGFLEGLRCSECIDWGEKRNTIASIAAGVLFFTGWWI IIDAAVIYPTMKDFNHSYHACGVI 
ATIAFLMINAVSNGQVRGDSYSEGCLGQTGARIWLFVGFMLAFGSLIASMWILFGGYVAKEK 
DIVYPGIAVFFQNAFIFFGGLVFKFGRTEDLWQ 
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FIGURE 58 

TTCTTGGCTAAAATCGGGGGAGTGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACC 
ACTGCACGACGGGGCTGGACTGACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATG 
CTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTAC 
TATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGAT 
TTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGC 
AGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTG 
CTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGG 
ATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATT 
TTTCCAGAATGCCTTCATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGC 
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FIGURE 59 

TGGACGGACCTGAAAAAAATGTTTGGATTTNTAGAGGGNTTGAGATGTTCAGAATGCATGAC 
TGGGGGAAAAGCGCAAATACTATTGCTTCCATTGCTGCTGGTGTANTATTTTTTACAGGCTG 
GTGGATTATCATAGATGCAGNTGTTATTTATCCCACCATGAAAGATTTCAACCANTCATACC 
ATGCCTGTGGTGTTATAGCAACCATAGCCTTCNTAATGATTAATGCAGTATCGAATGGACAA 
GTCCGAGGTGATAGTTACAGTGAAGGTTGTTTGGGTCAAACAGGTGCTCGCATTTGGCTTTT 
CGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTT 
ATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGNTGTATTTTTCCAGAATGCCTTC 
ATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGANTTATGGCAGTG 
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FIGURE 60 

GGACACCGGGTTCCGGACCAATGCANGACGGGGTGGANTGACCTGAAAAAAATGTTTGGATT 
TTTAGAGGGCTTGAGATGNTCAGAATGCATTGACTGGGGGAAAAGCGCAATANTATTGCTTT 
CCATTGCTGCTGGTGTACTATTTTTTACAGGGTGGTGGATTATCATAGATGCAGCTGTTATT 
TATCCCACCATGAAAGATTTNAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGC 
CTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTT 
GTTTGGGTCAAACAGGTGNTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATTT 
CTGATTGNATTCTATGCGGATTCTTCTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTAT 
ACCCTGGAATTNCTNTATTTTTCCAGAATGCC 
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FIGURE 61 

TAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATANTATTGCTTCC 
ATTGNTGNTGGTGTANTATTTTTTTACAGGCTGGTGGATTATNATAGATGCAGCTGTTATTT 
ATCCCACCATGAAAGATTTNAACCANTCATACCATGCCTGTGGTGTTATAGCAACCATAGCC 
TTCCTAATGATTAATGCAGTATNGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTG 
TTTGGGTCAAACAGGTGNTNGCATTTGGCTTTTNGTTGGTTTCATGTTGGCCTTTGGATCTN 
TGATTGCATTTATGTGGATTNTTTTTGGAGGTTATGTTGCTAAAGNAAAAGACATAGTATAC 
CCTGT 
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FIGURE 62 



GGGAGGCTGTGNCCGTTTTGTTTTNTTGGCTAAAATCGGGGGAGTGAGGCGGCCCGGCGCGG 
CGNGACACCGGGTTCCGGGAACCATTGCACGACGGGGTGGACTGACCTGAAAAAAATGTTTG 
GATTTNTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATT 
GCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGT 
TATT TATC C C AC CATG AAAG ATTT CAAC CACT CATAC C ATG C CTGTGGTGTT AT AG CAACC A 
TAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAA 
GGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGG 
ATNTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAG 
TATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATNTTTTTTGGAGGGCTG 



WO 99/63088 




PCTAJS99H2252 



FIGURE 63 

CGACGCCGGCGTGATGTGGCTTCCGCTGGTGCTGCTCCTGGCTGTGCTGCTGCTGGCCGTCC 
TCTGCAAAGTTTACTTGGGACTATTCTCTGGCAGCTCCCCGAATCCTTTCTCCGAAGATGTC 
AAACGGCCCCCAGCGCCCCTGGTAACTGACAAGGAGGCCAGGAAGAAGGTTCTCAAACAAGC 
TTTTTCAGCCAACCAAGTGCCGGAGAAGCTGGATGTGGTGGTAATTGGCAGTGGCTTTGGGG 
GCCTGGCTGCAGCTGCAATTCTAGCTAAAGCTGGCAAGCGAGTCCTGGTGCTGGAACAACAT 
ACCAAGGCAGGGGGCTGCTGTCATACCTTTGGAAAGAATGGCCTTGAATTTGACACAGGAAT 
CCATTACATTGGGCGTATGGAAGAGGGCAGCATTGGCCGTTTTATCTTGGACCAGATCACTG 
AAGGGCAGCTGGACTGGGCTCCCCTGTCCTCTCCTTTTGACATCATGGTACTGGAAGGGCCC 
AATGGCCGAAAGGAGTACCCCATGTACAGTGGAGAGAAAGCCTACATTCAGGGCCTCAAGGA 
GAAGTTTCCACAGGAGGAAGCTATCATTGACAAGTATATAAAGCTGGTTAAGGTGGTATCCA 
GTGGAGCCCCTCATGCCATCCTGTTGAAATTCCTCCCATTGCCCGTGGTTCAGCTCCTCGAC 
AGGTGTGGGCTGCTGACTCGTTTCTCTCCATTCCTTCAAGCATCCACCCAGAGCCTGGCTGA 
GGTCCTGCAGCAGCTGGGGGCCTCCTCTGAGCTCCAGGCAGTACTCAGCTACATCTTCCCCA 
CTTACGGTGTCACCCCCAACCACAGTGCCTTTTCCATGCACGCCCTGCTGGTCAACCACTAC 
ATGAAAGGAGGCTTTTATCCCCGAGGGGGTTCCAGTGAAATTGCCTTCCACACCATCCCTGT 
GATTCAGCGGGCTGGGGGCGCTGTCCTCACAAAGGCCACTGTGCAGAGTGTGTTGCTGGACT 
^AGCTGGGAAAGCCTGTGGTGTCAGTGTGAAGAAGGGGCATGAGCTGGTGAACATCTATTGC 
CCCATCGTGGTCTCCAACGCAGGACTGTTCAACACCTATGAACACCTACTGCCGGGGAACGC 
CCGCTGCCTGCCAGGTGTGAAGCAGCAACTGGGGACGGTGCGGCCCGGCTTAGGCATGACCT 
CTGTTTTCATCTGCCTGCGAGGCACCAAGGAAGACCTGCATCTGCCGTCCACCAACTACTAT 
GTTTACTATGACACGGACATGGACCAGGCGATGGAGCGCTACGTCTCCATGCCCAGGGAAGA 
GGCTGCGGAACACATCCCTCTTCTCTTCTTCGCTTTCCCATCAGCCAAAGATCCGACCTGGG 
AGGACCGATTCCCAGGCCGGTCCACCATGATCATGCTCATACCCACTGCCTACGAGTGGTTT 
G AGG AGTGG C AG G C GG AG CTGAAGGG AAAG CGGGGC AG TGACTATGAGAC CTTCAAAAACT C 
CTTTGTGGAAGCCTCTATGTCAGTGGTCCTGAAACTGTTCCCACAGCTGGAGGGGAAGGTGG 
AGAGTGTGACTGCAGGATCCCCACTCACCAACCAGTTCTATCTGGCTGCTCCCCGAGGTGCC 
TGCTACGGGGCTGACCATGACCTGGGCCGCCTGCACCCTTGTGTGATGGCCTCCTTGAGGGC 
CCAGAGCCCCATCCCCAACCTCTATCTGACAGGCCAGGATATCTTCACCTGTGGACTGGTCG 
GGGCCCTGCAAGGTGCCCTGCTGTGCAGCAGCGCCATCCTGAAGCGGAACTTGTACTCAGAC 
CTTAAGAATCTTGATTCTAGGATCCGGGCACAGAAGAAAAAGAATTASTTCCATCAGGGAGG 
AGTCAGAGGAATTTGCCCAATGGCTGGGGCATCTCCCTTGACTTACCCATAATGTCTTTCTG 
CATTAGTTCCTTGCACGTATAAAGCACTCTAATTTGGTTCTGATGCCTGAAGAGAGGCCTAG 
TTTAAATCACAATTCCGAATCTGGGGCAATGGAATCACTGCTTCCAGCTGGGGCAGGTGAGA 
TCTTTACGCCTTTTATAACATGCCATCCCTACTAATAGGATATTGACTTGGATAGCTTGATG 
TCTCATGACGAGCGGCGCTCTGCATCCCTCACCCATGCCTCCTAACTCAGTGATCAAAGCGA 
ATATTCCATCTGTGGATAGAACCCCTGGCAGTGTTGTCAGCTCAACCTGGTGGGTTCAGTTC 
TGTCCTGAGGCTTCTGCTCTCATTCATTTAGTGCTACGCTGCACAGTTCTACACTGTCAAGG 
GAAAAGGGAGACTAATGAGGCTTAACTCAAAACCTGGGCGTGGTTTTGGTTGCCATTCCATA 
GGTTTGGAGAGCTCTAGATCTCTTTTGTGCTGGGTTCAGTGGCTCTTCAGGGGACAGGAAAT 
GCCTGTGTCTGGCCAGTGTGGTTCTGGAGCTTTGGGGTAACAGCAGGATCCATCAGTTAGTA 
GGGTGCATGTCAGATGATCATATCCAATTCATATGGAAGTCCCGGGTCTGTCTTCCTTATCA 
TCGGGGTGGCAGCTGGTTCTCAATGTGCCAGCAGGGACTCAGTACCTGAGCCTCAATCAAGC 
CTTATCCACCAAATACACAGGGAAGGGTGATGCAGGGAAGGGTGACATCAGGAGTCAGGGCA 
TGGACTGGTAAGATGAATACTTTGCTGGGCTGAAGCAGGCTGCAGGGCATTCCAGCCAAGGG 
CACAGCAGGGGACAGTGCAGGGAGGTGTGGGGTAAGGGAGGGAAGTCACATCAGAAAAGGGA 
AAGCCACGGAATGTGTGTGAAGCCCAGAAATGGCATTTGCAGTTAATTAGCACATGTGAGGG 
TTAGACAGGTAGGTGAATGCAAGCTCAAGGTTTGGAAAAATGACTTTTCAGTTATGTCTTTG 
GTATCAGACATACGAAAGGTCTCTTTGTAGTTCGTGTTAATGTAACATTAATAAATTTATTG 
ATTCCATTGCTTTAAAAAAAAAAAAAAA 
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FIGURE 64 

MWLPLVLLIAVLLLAVLCKVYLGLFSGSSPNPFSEDVKRPPAPLVTDKEARKKVLKQAFSAN 
QVPEKLDVWIGSGFGGLAAAAILAKA^ 

RMEEGSIGRFILDQITEGQLDWAPLSSPFDIMVLEGPNGRKEYPMYSGEKAYIQGLKEKFPQ 
EEAIIDKYIKLVKVVSSGAPHAILLKFLPLPVVQLLDRCGLLTRFSPFLQASTQSLAEVLQQ 
LGASSELQAVLSYIFPTYGVTPNHSAFSMHALLVNHYMKGGFYPRGGSSEIAFHTIPVIQRA 
GGAVLTKATVQSVLLDSAGKACGVSVKKGHELVNIYCPIVVSNAGLFNTYEHLLPGNARCLP 
GVKQQLGTVRPGLGMTSVF I CLRGTKEDLHLPSTNYYVYYDTDMDQAMERYVSMPREEAAEH 
I PLL FF AF PS AKDPTWEDRF PGRS TM I ML I PTAYEWFEE WQAELKGKRGSDYETFKNS FVEA 
SMSVVLKLFPQLEGKVESVTAGSPLTNQFYLAAPRGACYGADHDLGRLHPC^T^SLRAQSPI 
PKLYLTGQDIFTCGLVGALQGALLCSSAILKRNLYSDLKNLDSRIRAQKKKN 
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FIGURE 65 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCTA 

GGGGTTGGCACCGGCCCCGAGAGGAGGATGCGGGTCCGGATAGGGCTGACGCTGCTGCTGTG 

TGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGGATGAAGAAGGCAGCCAGGATGAATCCT 

TAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTAAAGGACCATACTACTGCAGGCAGA 

GTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGAATTAGAATCCTCTATTCAAGA 

AGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAGAAGATATCAGCTTTCTAG 

AGT CTC CAAAT C CAGAAAAC AAGGACTATG AAGAGC CAAAGAAAGTACGGAAACCAG CTTTG 

ACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCTnTCCTAGATAA 

GGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTGCTACAACCT 

ATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCTAAGAGA 

CGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAGCAA 

TAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 

CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATC 

CAGGCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGC 

TCTTGGCTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTAT 

ATTATACATTTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT 

TAGTGG AAGG CT AAT AAT ATT AAC AT CAGAAG AATTT G TG GT TT AT AG CGG CCACAACTTTT 

TCAGCTTTCATGATCCAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTGCTTCAA 

ATTCTTGTTAATGGATATAACACATGGAATCTACATGTAAATGAAAGTTGGTGGAGTCCACA 

ATTTTTCTTTAAAATGATTAGTTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGC 

TCTTTTTAAATTTTCTCTGAGTTGGAATTGTCAGAATCATTTTTTACATTAGATTATCATAA 

TTTTAAAAATTTTTCTTTAGTTTTTCAAAATTTTGTAAATGGTGGCTATAGAAAAACAACAT 

GAAATATTATACAATATTTTGCAACAATGCCCTAAGAATTGTTAAAATTCATGGAGTTATTT 

GTGCAGAATGACTCCAGAGAGCTCTACTTTCTGTTTTTTACTTTTCATGATTGGCTGTCTTC 

CCATTTATTCTGGTCATTTATTGCTAGTGACACTGTGCCTGCTTCCAGTAGTCTCATTTTCC 

CTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGGAAGATTAACTCATTTTTAATAAA 

ATTATGTCTAAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 66 

MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLD 
SEES2LESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHG 
EPCHFPFLFLDKEYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMM 
YQTGMKILNGSNKKSQKREAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEK 
LTEEGSPKGQTALGFLYASGLGWSSQAKALVYYTFGALGGNLIAHMVLVSRL 
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FIGURE 67 

CTTCCCAGCCCTGTGCCCCAAAGCACCTGGAGCATATAGCCTTGCAGAACTTCTACTTGCCT 
GCCTCCCTGCCTCTGGCCATGGCCTGCCGGTGCCTCAGCTTCCTTCTGATGGGGACCTTCCT 
GTCAGTTTCCCAGACAGTCCTGGCCCAGCTGGATGCACTGCTGGTCTTCCCAGGCCAAGTGG 
CTCAACTCTCCTGCACGCTCAGCCCCCAGCACGTCACCATCAGGGACTACGGTGTGTCCTGG 
TACCAGCAGCGGGCAGGCAGTGCCCCTCGATATCTCCTCTACTACCGCTCGGAGGAGGATCA 
CCACCGGCCTGCTGACATCCCCGATCGATTCTCGGCAGCCAAGGATGAGGCCCACAATGCCT 
GTGTCCTCACCATTAGTCCCGTGCAGCCTGAAGACGACGCGGATTACTACTGCTCTGTTGGC 
TACGGCTTTAGTCCCTAGGGGTGGGGTGTGAGATGGGTGCCTCCCCTCTGCCTCCCATTTCT 
GCCCCTGACCTTGGGTCCCTTTTAAACTTTCTCTGAGCCTTGCTTCCCCTCTGTAAAATGGG 
1 TAATAATATTCAACATGTCAACAAC 
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FIGURE 68 

MACR CLS FLLMGTFLSVS QTVLAQLDALLV? PGQ VAQLS CTLS PQHVT I RD YGVS WYQQRAG 
SAPRYLLYYRSEEDHHRPADI PDRFSAAKDEAHNACVLT I S PVQPEDDADYYCS VGYGFS P 
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FIGURE 69 

GCCGCCCCGCCCCGAGACCGGGCCCGGGGGCGCGGGGCGGCGGGATGCGGCGCCCGGGGCGG 
CGATGACCGCGGAGCGCACGCCGCGGGCCCGGCCCTGACCCCGCCGCCCGCCCGCTGAGCCC 
CCCGCCGAGGTCCGGACAGGCCGAGATGACGCCGAGCCCCCTGTTGCTGCTCCTGCTGCCGC 
CGCTGCTGCTGGGGGCCTTCCCACCGGCCGCCGCCGCCCGAGGCCCCCCAAAGATGGCGGAC 
AAGGTGGTCCCACGGCAGGTGGCCCGGCTGGGCCGCACTGTGCGGCTGCAGTGCCCAGTGGA 
GGGGGACCCGCCGCCGCTGACCATGTGGACCAAGGATGGCCGCACCATCCACAGCGGCTGGA 
GCCGCTTCCGCGTGCTGCCGCAGGGGCTGAAGGTGAAGCAGGTGGAGCGGGAGGATGCCGGC 
GTGTACGTGTGCAAGGCCACCAACGGCTTCGGCAGCCTGAGCGTCAACTACACCCTCGTCGT 
GCTGGATGACATTAGCCCAGGGAAGGAGAGCCTGGGGCCCGACAGCTCCTCTGGGGGTCAAG 
AGGACCCCGCCAGCCAGCAGTGGGCACGACCGCGCTTCACACAGCCCTCCAAGATGAGGCGC 
CGGGTGATCGCACGGCCCGTGGGTAGCTCCGTGCGGCTCAAGTGCGTGGCCAGCGGGCACCC 
TCGGCCCGACATCACGTGGATGAAGGACGACCAGGCCTTGACGCGCCCAGAGGCCGCTGAGC 
CCAGGAAGAAGAAGTGGACACTGAGCCTGAAGAACCTGCGGCCGGAGGACAGCGGCAAATAC 
ACCTGCCGCGTGTCGAACCGCGCGGGCGCCATCAACGCCACCTACAAGGTGGATGTGATCCA 
GCGGACCCGTTCCAAGCCCGTGCTCACAGGCACGCACCCCGTGAACACGACGGTGGACTTCG 
GGGGGACCACGTCCTTCCAGTGCAAGGTGCGCAGCGACGTGAAGCCGGTGATCCAGTGGCTG 
AAGCGCGTGGAGTACGGCGCCGAGGGCCGCCACAACTCCACCATCGATGTGGGCGGCCAGAA 
GTTTGTGGTGCTGCCCACGGGTGACGTGTGGTCGCGGCCCGACGGCTCCTACCTCAATAAGC 
TGCTCATCACCCGTGCCCGCCAGGACGATGCGGGCATGTACATCTGCCTTGGCGCCAACACC 
ATGGGCTACAGCTTCCGCAGCGCCTTCCTCACCGTGCTGCCAGACCCAAAACCGCCAGGGCC 
ACCTGTGGCCTCCTCGTCCTCGGCCACTAGCCTGCCGTGGCCCGTGGTCATCGGCATCCCAG 
CCGGCGCTGTCTTCATCCTGGGCACCCTGCTCCTGTGGCTTTGCCAGGCCCAGAAGAAGCCG 
TGCACCCCCGCGCCTGCCCCTCCCCTGCCTGGGCACCGCCCGCCGGGGACGGCCCGCGACCG 
CAGCGGAGACAAGGACCTTCCCTCGTTGGCCGCCCTCAGCGCTGGCCCTGGTGTGGGGCTGT 
GTGAGGAGCATGGGTCTCCGGCAGCCCCCCAGCACTTACTGGGCCCAGGCCCAGTTGCTGGC 
CCTAAGTTGTACCCCAAACTCTACACAGACATCCACACACACACACACACACACTCTCACAC 
ACACTCACACGTGGAGGGCAAGGTCCACCAGCACATCCACTATCAGTGCTAGACGGCACCGT 
ATCTGCAGTGGGCACGGGGGGGCCGGCCAGACAGGCAGACTGGGAGGATGGAGGACGGAGCT 
GCAGACGAAGGCAGGGGACCCATGGCGAGGAGGAATGGCCAGCACCCCAGGCAGTCTGTGTG 
TGAGGCATAGCCCCTGGACACACACACACAGACACACACACTACCTGGATGCATGTATGCAC 
ACACATGCGCGCACACGTGCTCCCTGAAGGCACACGTACGCACACGCACATGCACAGATATG 
CCGCCTGGGCACACAGATAAGCTGCCCAAATGCACGCACACGCACAGAGACATGCCAGAACA 
TACAAGGACATGCTGCCTGAACATACACACGCACACCCATGCGCAGATGTGCTGCCTGGACA 
CACACACACACACGGATATGCTGTCTGGACGCACACACGTGCAGATATGGTATCCGGACACA 
CACGTGCACAGATATGCTGCCTGGACACACAGATAATGCTGCCTTGACACACACATGCACGG 
ATATTGCCTGGACACACACACACACACACGCGTGCACAGATATGCTGTCTGGACACGCACAC 
ACATGCAGATATGCTGCCTGGACACACACTTCCAGACACACGTGCACAGGCGCAGATATGCT 
GCCTGGACACACGCAGATATGCTGTCTAGTCACACACACACGCAGACATGCTGTCCGGACAC 
ACACACGCATGCACAGATATGCTGTCCGGACACACACACGCACGCAGATATGCTGCCTGGAC 
ACACACACAGATAATGCTGCCTCAACACTCACACACGTGCAGATATTGCCTGGACACACACA 
TGTGCACAGATATGCTGTCTGGACATGCACACACGTGCAGATATGCTGTCCGGATACACACG 
CACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGACACACATGCACACACAGGT 
GCAGATATGCTGCCTGGACACACACACAGATAATGCTGCCTCAACACTCACACACGTGCAGA 
TATTGCCTGGACACACACATGTGCACAGATATGCTGTCTGGACATGCACACACGTGCAGATA 
TGCTGTCCGGATACACACGCACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGA 
CACACATGCACACACAGGTGCAGATATGCTGCCTGGACACACGCAGACTGACGTGCTTTTGG 
GAGGGTGTGCCGTGAAGCCTGCAGTACGTGTGCCGTGAGGCTCATAGTTGATGAGGGACTTT 
CCCTGCTCCACCGTCACTCCCCCAACTCTGCCCGCCTCTGTCCCCGCCTCAGTCCCCGCCTC 
CATCCCCGCCTCTGTCCCCTGGCCTTGGCGGCTATTTTTGCCACCTGCCTTGGGTGCCCAGG 
AGTCCCCTACTGCTGTGGGCTGGGGTTGGGGGCACAGCAGCCCCAAGCCTGAGAGGCTGGAG 
CCCATGGCTAGTGGCTCATCCCCAGTGCATTCTCCCCCTGACACAGAGAAGGGGCCTTGGTA 
TTTATATTTAAGAAATGAAGATAATATTAATAATGATGGAAGGAAGACTGGGTTGCAGGGAC 
TGTGGTCTCTCCTGGGGCCCGGGACCCGCCTGGTCTTTCAGCCATGCTGATGACCACACCCC 
GTCCAGGCCAGACACCACCCCCCACCCCACTGTCGTGGTGGCCCCAGATCTCTGTAATTTTA 
TGTAGAGTTTGAGCTGAAGCCCCGTATATTTAATTTATTTTGTTAAACACAAAA 
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FIGURE 70 

MTPSPLLLLLLPPLLLGAFPPAAAARGPPKMADKVVPRQVARLGRTVRLQCPVEGDPPPLTM 
OTKDGRTIHSGWSRFRVLPQGLKVKQVEREDAGVYVCKATNGFGSLSVNYTLVVLDDISPGK 
ESLGPDSSSGGQEDPASQQWARPRFTQPSKMRRRVIARPVGSSVRLKCVASGHPRPDITWMK 
DDQALTRPEAAEPRKKKWTLSLKNLRPEDSGKYTCRVSNRAGAINATYKVDVIQRTRSKPVL 
TGTHPWTTVDFGGTTSFQCKVRSDVKPVIQWLKRVEYGAEGRHNSTIDVGGQKFVVLPTGD 
WSRPDGSYLNKLLITRARQDDAGMYICLGANTMGYSFRSAFLTVLPDPKPPGPPVASSSSA 
TSLPWPWIGIPAGAVFILGTLLLWLCQAQKKPCTPAPAPPLPGHRPPGTARDRSGDKDLPS 
IJVALSAGPGVGLCEEHGSPAAPQHLLGPGPVAGPKLYPKLYTDIHTHTHTHSHTHSKVEGKV 
HQHIHYQC 
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FIGURE 71 A 

CCCAGCTGAGGAGCCCTGCTCAAGACACGGTCACTGGATCTGAGAAACTTCCCAGGGGACCG 

CATTCCAGAGTCAGTGACTCTGTGAAGCACCCACATCTACCTCTTGCCACGTTCCCACGGGC 

TTGGGGGAAAGATGGTGGGGACCAAGGCCTGGGTGTTCTCCTTCCTGGTCCTGGAAGTCACA 

TCTGTGTTGGGGAGACAGACGATGCTCACCCAGTCAGTAAGAAGAGTCCAGCCTGGGAAGAA 

GAACCCCAGCATCTTTGCCAAGCCTGCCGACACCCTGGAGAGCCCTGGTGAGTGGACAACAT 

GGTTCAACATCGACTACCCAGGCGGGAAGGGCGACTATGAGCGGCTGGACGCCATTCGCTTC 

TACTATGGGGACCGTGTATGTGCCCGTCCCCTGCGGCTAGAGGCTCGGACCACTGACTGGAC 

ACCTGCGGGCAGCACTGGCCAGGTGGTCCATGGTAGTCCCCGTGAGGGTTTCTGGTGCCTCA 

ACAGGGAGCAGCGGCCTGGCCAGAACTGCTCTAATTACACCGTACGCTTCCTCTGCCCACCA 

GGATCCCTGCGCCGAGACACAGAGCGCATCTGGAGCCCATGGTCTCCCTGGAGCAAGTGCTC 

AGCTGCCTGTGGTCAGACTGGGGTCCAGACTCGCACACGCATTTGCTTGGCAGAGATGGTGT 

CGCTGTGCAGTGAGGCCAGCGAAGAGGGTCAGCACTGCATGGGCCAGGACTGTACAGCCTGT 

GACCTGACCTGCCCAATGGGCCAGGTGAATGCTGACTGTGATGCCTGCATGTGCCAGGACTT 

CATGCTTCATGGGGCTGTCTCCCTTCCCGGAGGTGCCCCAGCCTCAGGGGCTGCTATCTACC 

TCCTGACCAAGACGCCGAAGCTGCTGACCCAGACAGACAGTGATGGGAGATTCCGAATCCCT 

GGCTTGTGCCCTGATGGCAAAAGCATCCTGAAGATCACAAAGGTCAAGTTTGCCCCCATTGT 

ACTCACAATGCC CAAGACTAGCCTGAAGGCAGCCACCATCAAGG CAGAGTTTGTGAGGGCAG 

AGACTCCATACATGGTGATGAACCCTGAGACAAAAGCACGGAGAGCTGGGCAGAGCGTGTCT 

CTGTGCTGTAAGGCCACAGGGAAGCCCAGGCCAGACAAGTATTTTTGGTATCATAATGACAC 

ATTGCTGGATCCTTCCCTCTACAAGCATGAGAGCAAGCTGGTGCTGAGGAAACTGCAGCAGC 

ACCAGGCTGGGGAGTACTTTTGCAAGGCCCAGAGTGATGCTGGGGCTGTGAAGTCCAAGGTT 

GCCCAGCTGATTGTCACAGCATCTGATGAGACTCCTTGCAACCCAGTTCCTGAGAGCTATCT 

TATCCGGCTGCCCCATGATTGCTTTCAGAATGCCACCAACTCCTTCTACTATGACGTGGGAC 

CCTGCCCTGTTAAGACTTGTGCAGGGCAGCAGGATAATGGGATCAGGTGCCGTGATGCTGTG 

CAGAACTGCTGTGGCATCTCCAAGACAGAGGAAAGGGAGATCCAGTGCAGTGGCTACACGCT 

ACCCACCAAGGTGGCCAAGGAGTGCAGCTGCCAGCGGTGTACGGAAACTCGGAGCATCGTGC 

GGGGCCGTGTCAGTGCTGCTGACAATGGGGAGCCCATGCGCTTTGGCCATGTGTACATGGGG 

AACAGCCGTGTAAGCATGACTGGCTACAAGGGCACTTTCACCCTCCATGTCCCCCAGGACAC 

TGAGAGGCTGGTGCTCACATTTGTGGACAGGCTGCAGAAGTTTGTCAACACCACCAAAGTGC 

TACCTTTCAACAAGAAGGGGAGTGCCGTGTTCCATGAAATCAAGATGCTTCGTCGGAAAGAG 

CCCATCACTTTGGAAGCCATGGAGACCAACATCATCCCCCTGGGGGAAGTGGTTGGTGAAGA 

CCCCATGGCTGAACTGGAGATTCCATCCAGGAGTTTCTACAGGCAGAATGGGGAGCCCTACA 

TAGGAAAAGTGAAGGCCAGTGTGACCTTCCTGGATCCCCGGAATATTTCCACAGCCACAGCT 

GCCCAGACTGACCTGAACTTCATCAATGACGAAGGAGACACTTTCCCCCTTCGGACGTATGG 

CATGTTCTCTGTGGACTTCAGAGATGAGGTCACCTCAGAGCCACTTAATGCTGGCAAAGTGA 

AGGTCCACCTTGACTCGACCCAGGTCAAGATGCCAGAGCACATATCCACAGTGAAACTCTGG 

TCACTCAATCCAGAGACAGGGCTGTGGGAGGAGGAAGGTGATTTCAAATTTGAAAATCAAAG 

GAGGAACAAAAGAGAAGACAGAACCTTCCTGGTGGGCAACCTGGAGATTCGTGAGAGGAGGC 

TCTTTAACCTGGATGTTCCTGAAAGCAGGCGGTGCTTTGTTAAGGTGAGGGCCTACCGGAGT 

GAGAGGTTCTTGCCTAGTGAGCAGATCCAGGGGGTTGTGATCTCCGTGATTAACCTGGAGCC 

TAGAACTGGCTTCTTGTCCAACCCTAGGGCCTGGGGCCGCTTTGACAGTGTCATCACAGGCC 

CCAACGGGGCCTGTGTGCCTGCCTTCTGTGATGACCAGTCCCCTGATGCCTACTCTGCCTAT 

GTCTTGGCAAGCCTGGCTGGGGAGGAACTGCAAGCAGTGGAGTCTTCTCCTAAATTCAACCC 

AAATGCAATTGGCGTCCCTCAGCCCTATCTCAACAAGCTCAACTACCGTCGGACGGACCATG 

AGGATCCACGGGTTAAAAAGACAGCTTTCCAGATTAGCATGGCCAAGCCAAGGCCCAACTCA 

GCTGAGGAGAGCAATGGGCCCATCTATGCCTTTGAGAACCTCCGGGCATGTGAAGAGGCACC 

ACCCAGTGCAGCCCACTTCCGGTTCTACCAGATTGAGGGGGATCGATATGACTACAACACAG 

TCCCCTTCAACGAAGATGACCCTATGAGCTGGACTGAAGACTATCTGGCATGGTGGCCAAAG 

CCGATGGAATTCAGGGCCTGCTATATCAAGGTGAAGATTGTGGGGCCACTGGAAGTGAATGT 

GCGATCCCGCAACATGGGGGGCACTCATCGGCGGACAGTGGGGAAGCTGTATGGAATCCGAG 

ATGTGAGGAGCACTCGGGACAGGGACCAGCCCAATGTCTCAGCTGCCTGTCTGGAGTTCAAG 

TGCAGTGGGATGCTCTATGATCAGGACCGTGTGGACCGCACCCTGGTGAAGGTCATCCCCCA 

GGGCAGCTGCCGTCGAGCCAGTGTGAACCCCATGCTGCATGAGTACCTGGTCAACCACTTGC 

CACTTGCAGTCAACAACGACACCAGTGAGTACACCATGCTGGCACCCTTGGACCCACTGGG^ 

CACAACTATGGCATCTACACTGTCACTGACCAGGACCCTCGCACGGCCAAGGAGATCGCGCT 

CGGCCGGTGCTTTGATGGCACATCCGATGGCTCCTCCAGAATCATGAAGAGCAATGTGGGAG 

TAGCCCTCACCTTCAACTGTGTAGAGAGGCAAGTAGGCCGCCAGAGTGCCTTCCAGTACCTC 

CAAAGCACCCCAGCCCAGTCCCCTGCTGCAGGCACTGTCCAAGGAAGAGTGCCCTCGAGGAG 

G CAGCAG CGAGCGAG CAGGGGTGG CCAGCG CCAGGGTGGAGTGGTGG CCT CTCTGAGATTTC 
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FIGURE 71 B 

CTAGAGTTGCTCAACAGCCCCTGATCAACTAAGTTTTGTGGTACTTCACCCTCTTCTGCCCT 
CATTTCATGTGACAGCCATTGTGAGACTGATGCACAAACTGTCACTTGGTTAATTTAAGCAC 
TTCTGTTTTCGTGAATTTGCTTGTTTGTTTCTTCATGCCTTTACTTACTTTGTCCCATGCTA 
CTGATTGGCACGTGGCCCCCACAATGGCACAATAAAGCCCCTTTGTGAAACTGTTCTTTAAA 
TGAAACACAAGAAATTGGCCACTGGTAAAACTCTGCAGCTTCAACTGTACTTCATTTAATGC 
CATTAATG CAAATATACTTCCT CTT CTTTTTG GATGGTTTTG CCCAC CT CTG CAATAGTGAT 
AATCTGATGCTGAAGATCAAATAACCAATATAAAGCATATTTCTTGGCCTTGCTCCACAGGA 
CATAGGCAAGCCTTGATCATAGTTCATACATATAAATGGTGGTGAAATAAAGAAATAAAACA 
CAATACTTTTACTTGAAATGTAAATAACTTATTTATTTCTTTGCTAAATTTGGAATTCTAGT 
G CACATT CAAAGTT AAG CTATT AAAT AT AGGG TG AT C ATAGT T C CT CT AC C AAGT CTGG AAA 
GAACAT CT CCTGGTAT CC ACAATTACACCAGGTTG CTAACTGTATTTG T ACATTT C CCTTTG 
CATTCGCTTTTGTTCTTGCTAGAAACCCAGTGTAGCCCAGGGCAGATGTCAATAAATGCATA 
CT CTGTATTTCGAAAAAA 
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FIGURE 72 

MVGTKAOTFSFLVLEVTSVLGRQTMLTQSVRRVQPGKKNPSIFAKPADTLESPGEWTTWFNI 
DYPGGKGDYERLDAIRFYYGDRVCARPLRLEARTTDWTPAGSTGQWHGSPREGFWCLNREQ 
RPGQNCSNYTVRFLCPPGSLRRDTERIWSPWSPWSKCSAACGQTGVQTRTRICLAEMVSLCS 
EASEEGQHCMGQDCTACDLTCPMGQVNADCDACMCQDFMLHGAVSLPGGAPASGAAIYLLTK 
TPKLLTQTDSDGRFRI PGLCPDGKS I LKITKVKFAP IVLTMPKTSLKAATI KAEFVRAETP Y 
MVMNPETKARRAGQSVSLCCKATGKPRPDKYFVTYHNDTLLDPSLYKHESKLVLRKLQQHQAG 
EYFCKAQSDAGAVKSKVAQLIVTASDETPCNPVPESYLIRLPHDCFQNATNSFYYDVGRCPV 
KTCAGQQDNGIRCRDAVQNCCGISKTEEREIQCSGYTLPTKVAKECSCQRCTETRSIVRGRV 
SAADNGEPMRFGHVYMGNSRVSMTGYKGTFTLHVPQDTERLVLTFVDRLQKFWTTKVLPFN 
KKGSAVFHEIKMLRRKEPITLEAMETNIIPLGEWGEDPMAELEIPSRSFYRQNGEPYIGKV 
K/^SVTFLDPRNISTATAAQTDI^IITOEGDTFPLRTYGMFSVDFRDEVTSEPIJ^AGKVKVHL 
DSTQVKMPEHISTVKLWSLNPDTGLWEEEGDFKFENQRRNKREDRTFLVGNLEIRERRLFNL 
DVPESRRCFVKVRAYRSERFLPSEQIQGWISVINLEPRTGFLSNPRAWGRFDSVITGPNGA 
CTPAFCDDQSPDAYSAYVLASLAGEELQAVESSPKFNPNAIGVPQPYLNKLNYRRTDHEDPR 
VKKTAFQISMAKPRPNSAEESNGPIYAFENLRACEEAPPSAAHFRFYQIEGDRYDYNTVPFN 
EDDPMSOTEDYLAVmPKPMEFRACYIKVKIVGPLEVNVRSRNMGGTHRRTVGKLYGIRDVRS 
TRDRDQPOTSAACLEFKCSGMLYDQDRVDRTLVKVIPQGSCRRASVNPMLHEYLVNHLPLAV 
NNDTSEYTMLAPLDPLGH^GIYTVTDQDPRTAKEIALGRCFDGTSDGSSRIMKSNVGVALT 
FNCVERQVGRQSAFQYLQSTPAQSPAAGTVQGRVPSRRQQRASRGGQRQGGWASLRFPRVA 
QQPLIN 
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FIGURE 73 
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CTGCAAGTTGTTAACGCCTAACACACAAGTATGTTAGGCTTCCACCAAAGTCCTCAATATAC 

CTGAATACGCACAATATCTTAACTCTTCATATTTGGTTTTGGGATCTGCTTTGAGGTCCCAT 

CTTCATTTAAAAAAAAATACAGAGACCTACCTACCCGTACGCATACATACATATGTGTATAT 

ATATGTAAACT AG ACAAAG ATCG C AG AT CAT AAAG C AAG C TCTG CT TT AG TT T C CAAG AAGA 

TTACAAAGAATTTAGAGATGTATTTGTCAAGATCCCTGTCGATTCATGCCCTTTGGGTTACG 

GTGTCCTCAGTGATGCAGCCCTACCCTTTGGTTTGGGGACATTATGATTTGTGTAAGACTCA 

GATTTACACGGAAGAAGGGAAAGTTTGGGATTACATGGCCTGCCAGCCGGAATCCACGGACA 

TGACAAAATATCTGAAAGTGAAACTCGATCCTCCGGATATTACCTGTGGAGACCCTCCTGAG 

ACGTTCTGTGCAATGGGCAATCCCTACATGTGCAATAATGAGTGTGATGCGAGTACCCCTGA 

GCTGGCACACCCCCCTGAGCTGATGTTTGATTTTGAAGGAAGACATCCCTCCAC7VTTTTGGC 

AGTCTGCCACTTGGAAGGAGTATCCCAAGCCTCTCCAGGTTAACATCACTCTGTCTTGGAGC 

AAAACCATTGAGCTAACAGACAACATAGTTATTACCTTTGAATCTGGGCGTCCAGACCAAAT 

GATCCTGGAGAAGTCTCTCGATTATGGACGAACATGGCAGCCCTATCAGTATTATGCCACAG 

ACTGCTTAGATGCTTTTCACATGGATCCTAAATCCGTGAAGGATTTATCACAGCATACGGTC 

TTAGAAATCATTTGCACAGAAGAGTACTCAACAGGGTATACAACAAATAGCAAAATAATCCA 

CTTTGAAATCAAAGACAGGTTCGCGCTTTTTGCTGGACCTCGCCTACGCAATATGGCTTCCC 

TCTACGGACAGCTGGATACAACCAAGAAACTCAGAGATTTCTTTACAGTCACAGACCTGAGG 

ATAAGGCTGTTAAGACCAGCCGTTGGGGAAATATTTGTAGATGAGCTACACTTGGCACGCTA 

CTTTTACGCGATCTCAGACATAAAGGTGCGAGGAAGGTGCAAGTGTAATCTCCATGCCACTG 

TATGTGTGTATGACAACAGCAAATTGACATGCGAATGTGAGCACAACACTACAGGTCCAGAC 

TGTGGGAAATGCAAGAAGAATTATCAGGGCCGACCTTGGAGTCCAGGCTCCTATCTCCCCAT 

C C CCAAAGGCACTG CAAATA C C TG T AT C C CC AGT ATTT C C AGT ATTGG TACG AATGT CTGCG 

ACAACGAGCT CCTG CACTGCCAGAACGGAGGGACGTGC CACAACAACGTGCGCTGCCTGTGC 

CCGGCCGCATACACGGGCATCCTCTGCGAGAAGCTGCGGTGCGAGGAGGCTGGCAGCTGCGG 

CTCCGACTCTGGCCAGGGCGCGCCCCCGCACGGCACCCCAGCGCTGCTGCTGCTGACCACGC 

TGCTGGGAACCGCCAGCCCCCTGGTGTTCTAGGTGTCACCTCCAGCCACACCGGACGGGCCT 

GTGCCGTGGGGAAGCAGACACAACCCAAACATTTGCTACTAACATAGGAAACACACACATAC 

AGACACCCCCACTCAGACAGTGTACAAACTAAGAAGGCCTAACTGAACTAAGCCATATTTAT 

CACCCGTGGACAGCACATCCGAGTCAAGACTGTTAATTTCTGACTCCAGAGGAGTTGGCAGC 

TG TTG AT ATT AT CA CTG C AAAT CACATTGC CAG C TGCAG AG CATAT TG TGG ATTGG AAAGG C 

TGCGACAGCCCCCCAAACAGGAAAGACAAAAAACAAACAAATCAACCGACCTAAAAACATTG 

GCTACTCTAGCGTGGTGCGCCCTAGTACGACTCCGCCCAGTGTGTGGACCAACCAAATAGCA 

TTCTTTGCTGTCAGGTGCATTGTGGGCATAAGGAAATCTGTTACAAGCTGCCATATTGGCCT 

GCTTCCGTCCCTGAATCCCTTCCAACCTGTGCTTTAGTGAACGTTGCTCTGTAACCCTCGTT 

GGTTGAAAGATTTCTTTGTCTGATGTTAGTGATGCACATGTGTAACAGCCCCCTCTAAAAGC 

G C AAG CCAGTCATAC C CCTGTAT AT CTTAG CAG C ACTG AGTC CAGTG CGAGC AC ACAC CC AC 

TATACAAGAGTGGCTATAGGAAAAAAGAAAGTGTATCTATCCTTTTGTATTCAlAATGAAGTT 

ATTTTTCTTGAACTACTGTAATATGTAGATTTTTTGTATTATTGCCAATTTGTGTTACCAGA 

CAATCTGTTAATGTATCTAATTCGAATCAGCAAAGACTGACATTTTATTTTGTCCTCTTTCG 

TTCTGTTTTGTTTCACTGTGCAGAGATTTCTCTGTAAGGGCAACGAACGTGCTGGCATCAAA 

GAATATCAGTTTACATATATAACAAGTGTAATAAGATTCCACCAAAGGACATTCTAAATGTT 

TTCTTGTTGCTTTAACACTGGAAGATTTAAAGAATAAAAACTCCTGCATAAACGATTTCAGG 

AATTTGTATTGCAATTTCTTAAGATGAAAGGAACAGCCACCAAGCAGTTTCACACTCACTTT 

ACTGATTTCTGTGTGGACTGAGTACATTCAGCTGACGAATTTAGTTCCCAGGAAGATGGATT 

GATGTTCACTAGCTTGGACAACTTCTGCAAAATATGAGACTATTTCCACTTGGGAAAAATTA 

C AA CAG CAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 74 



MYLSRSLSIHALVA/TVSSVMQPYPLWGHYDLCKTQIYTEEGKVWDYM 

VKLDPPDITCGDPPETFCAMGNPYMCmECDASTPEIAHPPELMFDFEGRHPSTFWQSATWK 
EYPKPLQVNITLSWSKTIELTDNIVITFESGRPDQMILEKSLDYGRTWQPYQYYATDCLDAF 
HMDPKSVKDLSQHTVLEIICTEEYSTGYTTNSKI IHFEIKDRFALFAGPRLRNMASLYGQLD 
TTKKLRDFFTVTDLRI RLLRPAVGE I FVDELHLARYFYAI SDI KATRGRCKCNLHATVCVYDN 
SKLTCE CEHNTTGPDCGKCKKNYQGRPWS PGS YLP I PKGTANTCI PS I SS I GTNVCDNELLH 
CQNGGTCHNNVRCLCPAAYTGILCEKLRCEEAGSCGSDSGQGAPPHGTPALLLLTTLLGTAS 



PLVF 
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FTGURE 75 



CCCACGCGTCCGGGTGACCTGGGCCGAGCCCTCCCGGTCGGCTAAGATTGCTGAGGAGGCGG 
CGGGTAGCTGGCAGGCGCCGACTTCCGAAGGCCGCCGTCCGGGCGAGGTGTCCTCATGACTT 
CTCTTGTGGACCATGTCCGTGATCTTTTTTGCCTGCGTGGTACGGGTAAGGGATGGACTGCC 
CCTCTCAGCCTCTACTGATTTTTACCACACCCAAGATTTTTTGGAATGGAGGAGACGGCTCA 
AGAGTTTAGCCTTGCGACTGGCCCAGTATCCAGGTCGAGGTTCTGCAGAAGGTTGTGACTTT 
AGTATACATTTTTCTTCTTTCGGGGACGTGGCCTGCATGGCTATCTGCTCCTGCCAGTGTCC 
AGCAGCCATGGCCTTCTGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCCTATGACA 
CTACCTGCATTGGCCTAGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAG 
AAAGTGAAGTGGCATTTTAACTATGTAAGTTCCTCTCAGATGGAGTGCAGCTTGGAAAAAAT 
TCAGGAGGAGCTCAAGTTGCAGCCTCCAGCGGTTCTCACTCTGGAGGACACAGATGTGGCAA 
ATGGGGTGATGAATGGTCACACACCGATGCACTTGGAGCCTGCTCCTAATTTCCGAATGGAA 
CCAGTGACAGCCCTGGGTATCCTCTCCCTCATTCTCAACATCATGTGTGCTGCCCTGAATCT 
CATTCGAGGAGTTCACCTTGCAGAACATTCTTTACAGGATCCAAGGAGCTGGTTCTGCTGGT 
TGGACCAAACCTCGTGAGCCAGCCACCCCTGACCCAAATGAGGAGAGCTCTGATTCTCCCAT 
CCGGGAGCAGTGATGTCAAACTTCTGCTGCTGGGGAAATCTCATCAGCAGGGAGCCTGTGGA 
AAAGGGCATGTCAGTGAAATCTGGGAATGGCTGGATTCGGAAACATCTGCCCATGTGTATTG 
ATGGCAGAGCTGTTGCCCACAAGCGCCTTTTATTTAGGGTAAAATTAACAAATCCATTCTAT 
TCCTCTGACCCATGCTTAGTACATATGACCTTTAACCCTTACATTTATATGATTCTGGGGTT- 
GCTTCAGAAGTGTTATTTCATGAATCATTCATATGATTTGATCCCCCAGGATTCTATTTTGT 
TTAATGGG CTTTTCTACT AAAAG CATAAAATA CTGAGG CTGATTTAGTCAGGG CAAAACCAT 
TTACTTTACATATTCGTTTTCAATACTTGCTGTTCATGTTACACAAGCTTCTTACGGTTTTC 
TTGTAACAATAAATATTTTGAGTAAATAATGGGTACATTTTAACAAACTCAGTAGTACAACC 
TAAACTTGTATAAAAGTGTGTAAAAATGTATAGCCATTTATATCCTATGTATAAATTAAATG 
AGGTGGCTTCAGAAATGGCAGAATAAATCTAAAGTGTTTATTAAAAAAAAAAAAAAAAAAAA 
AAAAG 
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FIGURE 76 



msviffacotrvrdglplsastdfyhtqdflewrrrlkslalrlaqypgrgsaegcdfsihf 
ssfgdvacmaicscqcpaamafcfletlwweftasydttciglasrpyaflefdsiiqkvkw 
hfnyvss sqme cslek i qee lklq ppavltledtdvangvmnghtpmhlepapnfrme pvta 
lg i lsl i ln imcaalnl i rgvhlaehs lqdprswfcwldqts 
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FIGURE 77 

TGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCNTATGACACTACCTGCATTGGCNT 
AGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAGAAAGTGAAGTGGCATT 
TTAACTATGTAAGTTCCTNTCAGATGGAGTGCAGCTTGGAAAAAATTCAGGAGGAGCTCAAG 
TTGCAGCCTCCAGCGGTTCTCANTATGGAGGACACAGATGTGGCAAATGGGGT 
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FIGURE 78 

CTCAGCGGCGCTTCCTCGTAGCGAGCCTAGTGGCGGGTGTTTGCATTGAAACGTGAGCGCGA 
CCCGACCTTAAAGAGTGGGGAGCAAAGGGAGGACAGAGCCCTTTAAAACGAGGCGGGTGGTG 
CCTGCCCCTTTAAGGGCGGGGGGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTT 
TCTGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGGTGCTTGGCGGCGGCGGCTT 
CCTCCCCGCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTA 
TGGAAGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGC 
GAGTGTATTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGAC 
CCGCTTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGA 
TTGCGCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCC 
TTCTCCATCATCAGCAATGAGGTGCTGCTCTCCCTGCCTCGGAACTACTACATCCAGTGGCT 
CAACGGCTCCCTCATCCATGGCCTCTGGAACCTTGTTTTTCTCTTCCCCAACCTGTCCCTCA 
TCTTCCTCATGCCCTTTGCATATTTCTTCACTGAGTCTGAGGGCTTTGCTGGCTCCAGAAAG 
GGTGTCCTGGGCCGGGTCTATGAGACAGTGGTGATGTTGATGCTCCTCACTCTGCTGGTGCT 
AGGTATGGTGTGGGTGGCATCAGCCATTGTGGACAAGAACAAGGCCAACAGAGAGTCACTCT 
ATGACTTTTGGGAGTACTATCTCCCCTACCTCTACTCATGCATCTCCTTCCTTGGGGTTCTG 
CTGCTCCTGGTGTGTACTCCACTGGGTCTCGCCCGCATGTTCTCCGTCACTGGGAAGCTGCT 
AGTCAAGCCCCGGCTGCTGGAAGACCTGGAGGAGCAGCTGTACTGCTCAGCCTTTGAGGAGG 
CAGCCCTGACCCGCAGGATCTGTAATCCTACTTCCTGCTGGCTGCCTTTAGACATGGAGCTG 
CTACACAGACAGGTCCTGGCTCTGCAGACACAGAGGGTCCTGCTGGAGAAGAGGCGGAAGGC 
TTCAGCCTGGCAACGGAACCTGGGCTACCCCCTGGCTATGCTGTGCTTGCTGGTGCTGACGG 
GCCTGTCTGTGCTCATTGTGGCCATCCACATCCTGGAGCTGCTCATCGATGAGGCTGCCATG 
CCCCGAGGCATGCAGGGTACCTCCTTAGGCCAGGTCTCCTTCTCCAAGCTGGGCTCCTTTGG 
TGCCGTCATTCAGGTTGTACTCATCTTTTACCTAATGGTGTCCTCAGTTGTGGGCTTCTATA 
GCTCTCCACTCTTCCGGAGCCTGCGGCCCAGATGGCACGACACTGCCATGACGCAGATAATT 
GGGAACTGTGTCTGTCTCCTGGTCCTAAGCTCAGCACTTCCTGTCTTCTCTCGAACCCTGGG 
GCTCACTCGCTTTGACCTGCTGGGTGACTTTGGACGCTTCAACTGGCTGGGCAATTTCTACA 
TTGTGTTCCTCTACAACGCAGCCTTTGCAGGCCTCACCACACTCTGTCTGGTGAAGACCTTC 
ACTGCAGCTGTGCGGGCAGAGCTGATCCGGGCCTTTGGGCTGGACAGACTGCCGCTGCCCGT 
CTCCGGTTTCCCCCAGGCATCTAGGAAGACCCAGCACCAGTGACCTCCAGCTGGGGGTGGGA 
AGGAAAAAACTGGACACTGCCATCTGCTGCCTAGGCCTGGAGGGAAGCCCAAGGCTACTTGG 
ACCTCAGGACCTGGAATCTGAGAGGGTGGGTGGCAGAGGGGAGCAGAGCCATCTGCACTATT 
GCATT^ATCTGAGCCAGAGTTTGGGACCAGGACCTCCTGCTTTTCCATACTTAACTGTGGCCT 
CAGCATGGGGTAGGGCTGGGTGACTGGGTCTAGCCCCTGATCCCAAATCTGTTTACACATCA 
ATCTGCCTCACTGCTGTTCTGGGCCATCCCCATAGCCATGTTTACATGATTTGATGTGCAAT 
AGGGTGGGGTAGGGGCAGGGAAAGGACTGGGCCAGGGCAGGCTCGGGAGATAGATTGTCTCC 
CTTGCCTCTGGCCCAGCAGAGCCTAAGCACTGTGCTATCCTGGAGGGGCTTTGGACCACCTG 
AAAGACCAAGGGGATAGGGAGGAGGAGGCTTCAGCCATCAGCAATAAAGTTGATCCCAGGGA 
AAAAAA 
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FIGURE 79 

MEAPDYEVLSVREQLFHERIRECIISTLLFATLYILCHIFLTRFKKPAEFTTVDDEDATVNK 
IALELCTFTLAIALGAVLLLPFSI ISNEVLLSLPRNYYIQWLNGSLIHGLWNLVFLFPNLSL 
IFLMPFAYFFTESEGFAGSRKGVLGRVYETVVMLMLLTLLVL^ 

YDFWEYYLPYLYSCISFLGVLLLLVCTPLGLARMFSVTGKLLVKPRLLEDLEEQLYCSAFEE 
AALTRRICNPTSCWLPLDMELLHRQVI^LQTQRVLLEKRRKASAWQRNLGYPIJ^CLLVLT 
GLSVLIVAIHILELLIDEAAMPRGMQGTSLGQVSFSKLGSFGAVIQWLIFYLMVSSWGFY 
SSPLFRSLRPRWHDTAMTQI IGNCVCLLVLSSALPVFSRTLGLTRFDLLGDFGRFNWLGNFY 
IVFLYNAAFAGLTTLCLVKTFTAAVRAELIRAFGLDRLPLPVSGFPQASRKTQHQ 
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FIGURE 80 

GGCTGCCGAGGGAAGGCCCCTTGGGTTGGTCTTGGTTGCTTGGCGGCGGCGGNTTCNTCCCC 
GCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGAAGC 
ACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGTGTA 
TTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGACCCGCTTC 
AAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCG 
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FIGURE 81 

GACCGACCTTAAAGAGTGGGAGCAAAGGGAGGACAGAGCCTTTTAAAACGAGGCGGTGGTGC 
CTGCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTTTC 
TGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGTGCTTGGCGGCGGCGGCTTCCT 
CCCCGTTGTCNTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGA 
AGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGT 
GTATTATATCAACACTTCTGTTTGCAACACTGTACATCNTCTGCCACATCTTCCTGACCCGC 
TT C AAGAAG C CTG C TG AG TT CA C C AC AG TGGATGATGAAG ATG C CACCGTCAAC AAGATTG C 
GCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCCTTCT 
CCATCATCAGCAATGAGGTGCTGCACTCCC 
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FIGURE 82 

GATGTGCTCCTTGGAGCTGGTGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTT 
GGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTGGTGGTCCAGGTCTTCATGCTGCTGT 
GGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAGGCCCATT 
ATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAA 
GGGATTTCGCTTCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAA 
TACTAAGAGAAACCCCAGACAATATCCTTGAGGTTCAGGAATCTGGAGAGTACAGATGCCAG 
GCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTCAGAGATGGGATTTCC 
TCATGCTGCCCAGGCTAATGTTGAACTCCTGGGCTCAAGTGATCTGCTCACCTAGGCCTCTC 
AAAGCGCTGGGATTACAGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGAC 
VCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAGTAACACTGAATAATACTATTTACAAGAA 
TGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 83 

MLLWVILLVLAPVSGQFARTPRPIIFLQPPWTTVFQGERVTLTCKGFRFYSPQKTKWYHRYL 
GKE ILRETPDN I LEVQESGEYRCQAQGS PLS S PVHLDFS SEMGF PHAAQANVELLGSSDLLT 
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FIGURE 84 

CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCGGACCAGGGAGACCCCCGCGCCCCCCCGGTGT 
GAGGCGGCCTCACAGGGCCGGGTGGGCTGGCGAGCCGACGCGGCGGCGGAGGAGGCTGTGAG 
GAGTGTGTGGAACAGGACCCGGGACAGAGGAACCATGGCTCCGCAGAACCTGAGCACCTTTT 
GCCTGTTGCTGCTATACCTCATCGGGGCGGTGATTGCCGGACGAGATTTCTATAAGATCTTG 
GGGGTGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAACTAGCCCTGCA 

G C7TCAT C CCGAC C GG AA CC CTGATGAT C C ACAAG C C C AGG AGAAATT CCAGG AT CTGGG TG 
CTGCTTATGAGGTTCTGTCAGATAGTGAGAAACGGAAACAGTACGATACTTATGGTGAAGAA 
GGATTAAAAGATGGTCATCAGAGCTCCCATGGAGACATTTTTTCACACTTCTTTGGGGATTT 
TGGTTTCATGTTTGGAGGAACCCCTCGTCAGCAAGACAGAAATATTCCAAGAGGAAGTGATA 
T1ATTGTAGATCTAGAAGTCACTTTGGAAGAAGTATATGCAGGAAATTTTGTGGAAGTAGTT 
AGAAACAAACCTGTGGCAAGGCAGGCTCCTGGCAAACGGAAGTGCAATTGTCGGCAAGAGAT 
GCGGACCACCCAGCTGGGCCCTGGGCGCTTCCAAATGACCCAGGAGGTGGTCTGCGACGAAT 
GCCCTAATGTCAAACTAGTGAATGAAGAACGAACGCTGGAAGTAGAAATAGAGCCTGGGGTG 
AGAGACGGCATGGAGTACCCCTTTATTGGAGAAGGTGAGCCTCACGTGGATGGGGAGCCTGG 
AGATTTAC GGTT C CGAAT CAAAGTTGTC AAGC AC CC AAT ATTTGAAAGGAGAGG AG ATG ATT 
TGTACACAAATGTGACAATCTCATTAGTTGAGTCACTGGTTGGCTTTGAGATGGATATTACT 
CACTTGGATGGTCACAAGGTACATATTTCCCGGGATAAGATCACCAGGCCAGGAGCGAAGCT 
ATGGAAGAAAGGGGAAGGGCTCCCCAACTTTGACAACAACAATATCAAGGGCTCTTTGATAA 
TCACTTTTGATGTGGATTTTCCAAAAGAACAGTTAACAGAGGAAGCGAGAGAAGGTATCAAA 
CAGCTACTGAAACAAGGGTCAGTGCAGAAGGTATACAATGGACTGCAAGGATATTS&GAGTG 
AATAAAAT TGGACTTT GT TTAAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTTTTT 
TTGTGTGTGTTTTTGTTTTTATTTTCAATATGCAAGTTAGGCTTAATTTTTTTATCTAATGA 
TCATCATGAAATGAATAAGAGGGCTTAAGAATTTGTCCATTTGCATTCGGAAAAGAATGACC 
AGCAAAAGGTTTACTAATACCTCTCCCTTTGGGGATTTAATGTCTGGTGCTGCCGCCTGAGT 
TTCAAGAATTAAAGCTGCAAGAGGACTCCAGGAGCAAAAGAAACACAATATAGAGGGTTGGA 
GTTGTTAGCAATTTCATTCAAAATGCCAACTGGAGAAGTCTGTTTTTAAATACATTTTGTTG 
TTATTTTTA 
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FIGURE 85 

MAPQNLSTFCLLLLYL I GAVI AGRDFYK I LGVPRSAS I KD I KKAYRKLALQLHPDRNPDDPQ 
AQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHFFGDFGFMFGGTPRQQ 
DRNI PRGSD 1 1 VDLEVTLEE VYAGNFVEVVRNKPVARQAPGKRKCNCRQEMRTTQLGPGRFQ 
MTQEWCDECPNVKLVNEERTLEVE I EPGVRDGMEYPF I GEGEPHVDGE PGDLRFR I KWKH 
P I FERRGDDLYTNVTI SLVESLVGFEMDITHLDGHKVH I SRDKITRPGAKLWKKGEGLPNFD 
NNNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQKVYNGLQGY 
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FIGURE 86 

TG GG ACCAGGGAAC C C CGGGC CC CC CGG TGGAGNG C CTAACAGG CC GG TGGNTG CG AC CG AA 
GCGGCGGGCGGAGGAGGTTTTGAGGATTTTTGGAACAGGACCCGGACAGAGGAACCATGGTT 
CCGCAGAACNTGAGCACNTTTTGCCTGTTGNTGNTATACTTCATCGGGGCGGTGATTGCCGG 
ACGAGATTTNTATAAGATTTTGGGGTGCCTNGAAGTGCCTTNTATAAAGGATATTAAAAAGG 
CCTATAGGAAACTAGCCCTGCAGNTTTATCCCGACCGGAACCCTGATGATCCACAAGCCCAG 
GAGAAATTCCAGGATTTGGGTGCTGCTTATGAGGTTNTGTCAGATAGTGAGAAACGGAAACA 
GTACGATAATTATGGTGAAGAAGGATTAAAAGATGGTNATCAGAGCTCCCATGGAGACATTT 
TTTCACACTTNTTTGGGGATTTTGGTTTCATGTTTGGAGGAACCCCTNGTCAGCAAGACAGA 
AAT ATTC CAAGAG 
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FIGURE 87 

GGCACGAGGCGGCGGGGCAGTCGCGGGATGCGCCCGGGAGCCACAGCCTGAGGCCCTCAGGT 
CTCTGCAGGTGTCGTGGAGGAACCTAGCACCTGCCATCCTCTTCCCCAATTTGCCACTTCCA 
GCAGCTTTAGCCCATGAGGAGGATGTGACCGGGACTGAGTCAGGAGCCCTCTGGAAGCATGG 
AGACTGTGGTGATTGTTGCCATAGGTGTGCTGGCCACCATCTTTCTGGCTTCGTTTGCAGCC 
TTGGTGCTGGTTTGCAGGCAGCGCTACTGCCGGCCGCGAGACCTGCTGCAGCGCTATGATTC 
TAAGCCCATTGTGGACCTCATTGGTGCCATGGAGACCCAGTCTGAGCCCTCTGAGTTAGAAC 
TGGACGATGTCGTTATCACCAACCCCCACATTGAGGCCATTCTGGAGAATGAAGACTGGATC 
GAAGATGCCTCGGGTCTCATGTCCCACTGCATTGCCATCTTGAAGATTTGTCACACTCTGAC 
AGAGAAGCTTGTTGCCATGACAATGGGCTCTGGGGCCAAGATGAAGACTTCAGCCAGTGTCA 
GCGACATCATTGTGGTGGCCAAGCGGATCAGCCCCAGGGTGGATGATGTTGTGAAGTCGATG 
TACCCTCCGTTGGACCCCAAACTCCTGGACGCACGGACGACTGCCCTGCTCCTGTCTGTCAG 
TCACCTGGTGCTGGTGACAAGGAATGCCTGCCATCTGACGGGAGGCCTGGACTGGATTGACC 
AGTCTCTGTCGGCTGCTGAGGAGCATTTGGAAGTCCTTCGAGAAGCAGCCCTAGCTTCTGAG 
CCAGATAAAGGCCTCCCAGGCCCTGAAGGCTTCCTGCAGGAGCAGTCTGCAATTTAGTGCCT 
ACAGGCCAGCAGCTAGCCATGAAGGCCCCTGCCGCCATCCCTGGATGGCTCAGCTTAGCCTT 
CTACTTTTTCCTATAGAGTTAGTTGTTCTCCACGGCTGGAGAGTTCAGCTGTGTGTGCATAG 
TAAAGCAGGAGATCCCCGTCAGTTTATGCCTCTTTTGCAGTTGCAAACTGTGGCTGGTGAGT 
GGCAGTCTAATACTACAGTTAGGGGAGATGCCATTCACTCTCTGCAAGAGGAGTATTGAAAA- 
CTGGTGGACTGTCAGCTTTATTTAGCTCACCTAGTGTTTTCAAGAAAATTGAGCCACCGTCT 
AAGAAATCAAGAGGTTTCACATTAAAATTAGAATTTCTGGCCTCTCTCGATCGGTCAGAATG 
TGTGGCAATTCTGATCTGCATTTTCAGAAGAGGACAATCAATTGAAACTAAGTAGGGGTTTC 
TTCTTTTGGCAAGACTTGTACTCTCTCACCTGGCCTGTTTCATTTATTTGTATTATCTGCCT 
GGTCCCTGAGGCGTCTGGGTCTCTCCTCTCCCTTGCAGGTTTGGGTTTGAAGCTGAGGAACT 
ACAAAGTTGATGATTTCTTTTTTATCTTTATGCCTGCAATTTTACCTAGCTACCACTAGGTG 
GATAGTAAATTTATACTTATGTTTCCCTCAAAAAAAAAAAAAAA 
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FIGURE 88 

METWIVAIGV1ATIFLASFAALVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPSEL 
ELDDWI TNPHI EAI LENEDWI EDASGLMSHC I AI LKI CHTLTEKLVAMTMGSGAKMKTS AS 
VSDIIWAKRISPRVDDVVKSMYPPLDPKLLDARTTALLLSVSHLVLVTRNACHLTGGLDWI 
DQSLSAAEEHLEVLREAALASEPDKGLPGPEGFLQEQSAI 
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FIGURE 89 

GCTTCATTTCTCCCGACTCAGCTTCCCACCCTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCC 
CCACCACTGCAGCCATGATCTCCTTAACGGACACGCAGAAAATTGGAATGGGATTAACAGGA 
TTTGGAGTGTTTTTCCTGTTCTTTGGAATGATTCTCTTTTTTGACAAAGCACTACTGGCTAT 
TGGAAATGTTTTATTTGTAGCCGGCTTGGCTTTTGTAATTGGTTTAGAAAGAACATTCAGAT 
TCTTCTTCCAAAAACATAAAATGAAAGCTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTC 
CTTATTGGTTGGCCTTTGATAGGCATGATCTTCGAAATTTATGGATTTTTTCTCTTGTTCAG 
GGGCTTCTTTCCTGTCGTTGTTGGCTTTATTAGAAGAGTGCCAGTCCTTGGATCCCTCCTAAAT 
TTACCTGGAATTAGATCATTTGTAGATAAAGTTGGAGAAAGCAACAATATGGTATAACAACA 
AGTGAATTTGAAGACTCATTTAAAATATTGTGTTATTTATAAAGTCATTTGAAGAATATTCA 
GCACAAAATTAAATTACATGAAATAGCTTGTAATGTTCTTTACAGGAGTTTAAAACGTATAG 
CCTACAAAGTACCAGCAGCAAATTAGCAAAGAAGCAGTGAAAACAGGCTTCTACTCAAGTGA 
ACTAAGAAGAAGTCAGCAAGCAAACTGAGAGAGGTGAAATCCATGTTAATGATGCTTAAGAA 
ACTCTTGAAGGCTATTTGTGTTGTTTTTCCACAATGTGCGAAAC!TCAGCCATCCTTAGAGAA 
CTGTGGTGCCTGTTTCTTTTCTTTTTATTTTGAAGGCTCAGGAGCATCCATAGGCATTTGCT 
TTTTAGAAGTGTCCACTGCAATGGCAAAAATATTTCCAGTTGCACTGTATCTCTGGAAGTGA 
TGCATGAATTCGATTGGATTGTGTCATTTTAAAGTATTAAAACCAAGGAAACCCCAATTTTG 
^TGTATGGATTACTTTTTTTTGNGCNCAGGGCC 



WO 99/63088 g £ 0 PCT/US99A2252 

FIGURE 90 

MISLTDTQKIGMGLTGFGVFFLFFGMILFFDKALLAIGNVLFVAGLAFVIGLERTFRFFFQK 
HKMKATGFFLGGVFVVLIGWPLIGMIFEIYGFFLLFRGFFPVVVGFIRRVPVLGSLLNLPGI 
RSFVDKVGESNNMV 



Important features: 
Transmembrane domains : 

amino acids 12-30 (typell) , 33-52, 69-89 and 93-109 

N-myristoylation sites. 

5mino acids 11-16, 51-56 and 116-121 



Aminoacyl- transfer RNA synthetases class-II protein. 

amino acids 49-59 
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FIGURE 91 



GAAGACGTGGCGGCTCTCGCCTGGGCTGTTTCCCGGCTTCATTTCTCCCGACTCAGCTTCCC 
ACCNTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCCCCACCACTGCAGCCATGATCTCCTTAA 
CGGACACGCAGAAAATTGGAATGGGATTAACCGGATTTGGAGTGTTTTTCCTGTTCTTTGGA 
ATGATTCTCTTTTTTGACAAAGCACTACTGGCTATTGGAAATGTTTTATTTGTAGCCGGCTT 
GG CTTTTGTAATTGGTTTAGAAAGAACATTCAGATTCTTCTTCCAAAAACATAAAATGAAAG 
CTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTCCTTATTGGTTGGCCTTTGATAGGCATG 
ATCTTCGAAATTTATGGATTTTTTCTCTTGTTC 
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FIGURE 92 

GGCACGAGGCTGAACCCAGCCGGCTCCATCTCAGCTTCTGGTTTCTAAGTCCATGTGCCAAA 

GGCTGCCAGGAAGGAGACGCCTTCCTGAGTCCTGGATCTTTCTTCCTTCTGGAAATCTTTGA 

CTGTGGGTAGTTATTTATTTCTGAATAAGAGCGTCCACGCATCATGGACCTCGCGGGACTGC 

TGAAGTCTCAGTTCCTGTGCCACCTGGTCTTCTGCTACGTCTTTATTGCCTCAGGGCTAATC 

ATCAACACCATTCAGCTCTTCACTCTCCTCCTCTGGCCCATTAACAAGCAGCTCTTCCGGAA 

GATCAACTGCAGACTGTCCTATTGCATCTCAAGCCAGCTGGTGATGCTGCTGGAGTGGTGGT 

CGGGCACGGAATGCACCATCTTCACGGACCCGCGCGCCTACCTCAAGTATGGGAAGGAAAAT 

GCCATCGTGGTTCTCAACCACAAGTTTGAAATTGACTTTCTGTGTGGCTGGAGCCTGTCCGA 

ACGCTTTGGGCTGTTAGGGGGCTCCAAGGTCCTGGCCAAGAAAGAGCTGGCCTATGTCCCAA 

TTATCGGCTGGATGTGGTACTTCACCGAGATGGTCTTCTGTTCGCGCAAGTGGGAGCAGGAT 

CGCAAGACGGTTGCCACCAGTTTGCAGCACCTCCGGGACTACCCCGAGAAGTATTTTTTCCT 

GATTCACTGTGAGGGCACACGGTTCACGGAGAAGAAGCATGAGATCAGCATGCAGGTGGCCC 

GGGCCAAGGGGCTGCCTCGCCTCAAGCATCACCTGTTGCCACGAACCAAGGGCTTCGCCATC 

ACCGTGAGGAGCTTGAGAAATGTAGTTTCAGCTGTATATGACTGTACACTCAATTTCAGAAA 

TAATGAAAATCCAACACTGCTGGGAGTCCTAAACGGAAAGAAATACCATGCAGATTTGTATG 

TTAGGAGGATCCCACTGGAAGACATCCCTGAAGACGATGACGAGTGCTCGGCCTGGCTGCAC 

AAGCTCTACCAGGAGAAGGATGCCTTTCAGGAGGAGTACTACAGGACGGGCACCTTCCCAGA 

GACGCCCATGGTGCCCCCCCGGCGGCCCTGGACCCTCGTGAACTGGCTGTTTTGGGCCTCGC 

TGGTGCTCTACCCTTTCTTCCAGTTCCTGGTCAGCATGATCAGGAGCGGGTCTTCCCTGACG 

CTGGCCAGCTTCATCCTCGTCTTCTTTGTGGCCTCCGTGGGAGTTCGATGGATGATTGGTGT 

GACGGAAATTGACAAGGGCTCTGCCTACGGCAACTCTGACAGCAAGCAGAAACTGAATGACT 

GACTCAGGGAGGTGTCACCATCCGAAGGGAACCTTGGGGAACTGGTGGCCTCTGCATATCCT 

CCTTAGTGGGACACGGTGACAAAGGCTGGGTGAGCCCCTGCTGGGCACGGCGGAAGTCACGA 

CCTCTCCAGCCAGGGAGTCTGGTCTCAAGGCCGGATGGGGAGGAAGATGTTTTGTAATCTTT 

TTTTCCCCATGTGCTTTAGTGGGCTTTGGTTTTCTTTTTGTGCGAGTGTGTGTGAGAATGGC 

TGTGTGGTGAGTGTGAACTTTGTTCTGTGATCATAGAAAGGGTATTTTAGGCTG CAGGGGAG 

GGCAGGGCTGGGGACCGAAGGGGACAAGTTCCCCTTTCATCCTTTGGTGCTGAGTTTTCTGT 

AACCCTTGGTTGCCAGAGATAAAGTGAAAAGTGCTTTAGGTGAGATGACTAAATTATGCCTC 

CAAGAAAAAAAAATTAAAGTGCTTTTCTGGGTCAAAAAAAAAAAA 
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FIGURE 93 

MDLAGLLKSQFLCHLVFCYVFIASGLI INTIQLFTLLLWPINKQLFRXINCRLSYCISSQLV 
MLLEWWSGTECTIFTDPRAYLKYGKENAIVVLNHKFEIDFLCGWSLSERFGLLGGSKVLAKK 
EIAYVPIIGWMVT^FTEWFCSRKWEQDRKTVATSLQHLRDYPEKYFFLIHCEGTRFTEKKHE 
ISMQVARAKGLPRLKHHLLPRTKGFAITWSLRNWSAVYDCTLNFRNNENPTLLGVLNGKK 
YHADLYVRRIPLEDIPEDDDECSAWLHKLYQEKDAFQEEYYRTGTFPETPMVPPRRPWTLVN 
VOjF^ASLVLYPFFQFLVSMIRSGSSLTLASFILVFFVASVGVRWMIGVTEIDKGSAYGNSDS 
KQKLND 
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FIGURE 94 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCG 

GCGCACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAA 

GTAAAAGGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTA 

TACAATTGACATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAG 

GCGAAGTAAATGAGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGT 

TGGTACAAATTCCGTCGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAA 

AAACTTGCAGGAGCATTTTTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAA 

TAACAGAAAGCTGCTCTACTCATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTT 

TTTCACAGGGTACCTTTAGTGGTTGCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAAC 

TGTATCAGGTTCCTGTATGTCCACTGGTTTTAGCCGAGCAGTACAAACACACAGCTCTAAAT 

TTTTTSAAGAAGATGGATCCTTAAAGGAGGTACATAAGATAAATGAAATGTATGCTTCATTA 

CAAGAGGAATTAAAGAGTATATGCAAAAAAGTGGAAGACAGTGAACAAGCAGTAGATAAACT 

AGTAAAGGATGTAAACAGATTAAAACGAGAAATTGAGAAAAGGAGAGGAGCACAGATTCAGG 

CAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGAACATTTTTCTTTGTCAGGCATTA 

CGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTTATGTCTTTAAAAAATAGACA 

TGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGTAGACAATCTGACCTTAA 

TGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCACAAATCATTAAGCAT 

AAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTAGATACACAAGA 

CAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAATGAGCAGCC 

CAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTCCTACA 

TTTTGATCCTTTTAACCTTACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAACAT 

TTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCAC 

CTGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAAC 

ATCAGATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCT 

TACACAGACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAA 

TCCCAGCACTTAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCC 

TGGGCAACGTATTGAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTAT 

TTTCAAAATATGGAAAGAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAG 

TGATACTTTTTTAGAAGTACATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCA 

AT AAATTTG C AAAA CAT CAT C T AAAATT T AAAAAAAAAAAAAAAAAAAAAA 
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FTGIJRE 95 

MEGESTSAVLSGFVLGAIAFQHLNTDSDTEGFLLGEVKGEAK^SITDSQMDDVEVVYTIDIQ 
KYIPCTQLFSFYNSSGEVNEQALKKILSNVKKNWGOTKFRRHSDQIMTFRERLLHKNLQEH 
FSNQDLVFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSC 
MSTGFSRAVQTHSSKFFEEDGSLKEVHKINEMYASLQEELKS I CKKVEDSEQAVDKLVKDVN 
RLKREIEK^GAQIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLKSCVMSLKNRHVSKSS 
CnSTYESTHHLDWD^TLMVEHTDI PEAS PASTPQ 1 1 KHKALDLDDRWQFKRSRLLDTQDKRS KA 
NTGSSNQDKASKMSSPETDEEIEKMKGFGEYSRSPTF 
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FIGURE 96 

GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCA 

GCGCAGGGCAGCCCAAGCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCC 

CCGCGGCGCCTGCCACCCTTCGCTCCTTCCCCGCGTCCCCGCCTCGCCGGCCAGTCAGCTTG 

CCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCTGCTTCCCTGGGCCG 

CGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCT 

GACGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCA 

ACTCCAACTCCTTCTCCCTCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCC 

GCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAAGGTGGGAACGCGTCCGCCCCGGCCCGCA 

CCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTCAGCGCCGCGCTG 

CTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGG 

CTTCAACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCC 

AGGGTT CTAC CTGCTGCT CT CAAGAGATGGAGGAGAAGTACAGCCTG CAAAGTAAAGATGAT 

TTCAAAAGTGTGGTCAGCGAACAGTGCAATCATTTGCAAGCTGTCTTTGCTTCACGTTACAA 

GAAGTTTGATGAATTCTT CAAAGAACTACTTGAAAATG CAGAGAAATC CCTGAATGATATGT 

TTGTGAAG ACATATGG C CATTTATACATGCAAAATTCTGAGCTATTTAAAGATCTCTTCGTA 

GAGTTGAAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTG 

GG CT CG CCTC CTGGAGCGGATGTT CCGC CTGGTGAACTCC CAGT AC CACTTT ACAG ATG AGT 

ATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAGCCCTTCGGAGATGTCCCTCGCAAA 

TTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGCTTAGCGGT 

TGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCC 

TGTTGAAGATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGT^AGCCATGTTACAAC 

TACTGCTCAAACATCATGAGAGGCTGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAA 

CAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTAGAGGGTCCTTTCAACATTGAAT 

CGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGATAATAGT 

GTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACG 

AATTTCTCGTTCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGG 

AACGCCCAACCACAGCAGCTGGCACTAGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAA 

CTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAACGTTTGCAACGATGAGAGGAT 

GGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTACCTGT 

TTGCAGTGACAGGAAATGGATTAGCCAACCAGGGCAACAACCCAGAGGTCCAGGTTGACACC 

AGCAAACCAGACATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGAT 

GAAGAATGCATACAATGGGAACGACGTGGACTTCTTTGATATCAGTGATGAAAGTAGTGGAG 

AAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAGTTTGACTACAATGCCACT 

GACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCACA 

GGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAAT 

TCTCAAACTCTGAGAAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTA 

CCATCCTAGTGACTTTGCTTTTTAAATGAATGGACAACAATGTACAGTTTTTACTATGTGGC 

CACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGGAGGAAAAGGGACTGTG 

CATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 

CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTC 

ATTTCGTTTGTGGGTTTTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAG 

GGTCCCTTCTTGGCACGTAACATGTACGTATTTCTGAAATATTAAATAGCTGTACAGAAGCA 

GGTTTTATTTATCATGTTATCTTATTAAAAGAAAAAGCCCAAAAAGC " 
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FIGURE 97 

^mRFGLPALLCTIAVLSAALLAAELKSKSCSEVRRLyVSKGFNKNDAPLHEINGDHLKICPQ 
GSTCCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMF 
VKTYGHLYMQNSELFKDLFVELKRYYWGNWLEEMLNDFWARLLERMFRLVNSQYHFTDEY 
LE CVS KYTEQLK P FGD VP RKLKLQVTRAFVAART FAQG LAVAGD WS KVS WNPTAQCTHAL 
LKMIYCSHCRGLVTVKPCYOTCSNIMRGCLJ^Q 

VMDPIDVKISDAIMNMQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEE 
RPTTAAGTSLDRLVTDVKEKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLF 
AVTGNGLANQGNN P EVQ VDTS K PD I L I LRQ I MALRVMTS KMKNAYNGNDVDF FDISDESSGE 
GSGSGCEYQQCPSEFDYNATDHAGKSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 
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FIGURE 98 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCT 
GACCTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAA 
GCAACTTACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGC 
TGCCACTAATGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCAC 
AGGGACCGAGGCCAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAA 
AGATTGGTTCCTGAGAGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGC 
AGTGCCCCTGTGATCATTTCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGA 
AAGCCAAACAAGCATTCCAGAGCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTT 
TGCTCTGCCTTTGT^GAGCTCTGAGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAA 
GACAGTGAGCACACCTACCAGACACTCTTCTTCTCCCACCTCACTCTCCCACTGTACCCACC 
CCTAAATCATTCCAGTGCTCTCAAAAAGCATGTTTTTCAAGATCATTTTGTTTGTTGCTCTC 
TCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCTGTGCCCTCCCCTTACCCAGGCTTAGGCTT 
AATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCTAGCTAGTGTCATTTAACCTTAAATGC 
AATCAGGAAAGTAGCAAACAGAAGTC^TAAATATTTTTAAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 99 

MKVLISSLLLLLPLMLMSWSSSI^PGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRR 
KFMTVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 
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FIGURE 100 

AATGGCTGTCTTAGTACTTCGCCTGACAGTTGTCCTGGGACTGCTTGTCTTATTCCTGACCT 
GCTATGCAGACGACAAACCAGACAAGCCAGACGACAAGCCAGACGACTCGGGCAAAGACCCA 
AAGCCAGACTTCCCCAAATTCCTAAGCCTCCTGGGCACAGAGATCATTGAGAATGCAGTCGA 
GTTCATCCTCCGCTCCATGTCCAGGAGCACAGGATTTATGGAATTTGATGATAATGAAGGAA 
AACATTCATCAAAGTGACATCCTCAGGACACACCCATGTGGCTCCTGGACAATCCAAGAGCA 
GCCAAATCCTGCTTTTCCAGTTTGGCTCCACAAGTCCTCCAGGACAGAGCCCTCAAAGCAAC 
TCCCAACGAGTTCTCAGGATTCAGGCTCTGGCTTCAACCAAACAGAACTCATTTTGAACACC 
CTGACTGCATTTTTGCTTTTAGAAAGTTAGAATAAATATGGCGCTTTGGGATCACATAGTTG 
ATGGAGAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 101 



MAVLVLRLTVVLGLLVLFLTCYADDKPDKPDDKPDDSGKDPKPDFPKFLSLLGTEIIENAVE 
F I LRSMSRS TGFME FDDNEGKHS SK 
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FIGURE 102 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCT 

CAGAGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTAC 

CCTGCCCCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCC 

CCTACCTGATGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAG 

CTCTTCAGCCAGATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGG 

CTGCGGAACCGGAGCCAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACC 

CAAATCCCCACTTTGAGAAGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATAT 

GAGCGGTTTGTGGTGGCTCCTGGAGAGGACATGAGACAGCTGGCTGATGGCTC.CATGGATGT 

GGTGGTCTGCACTCTGGTGCTGTGCTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCC 

OGAGAGTACTGAGACCGGGAGGTGTGCTCTTTTTCTGGGAGCATGTGGCAGAACCATATGGA 

AGCTGGGCCTTCATGTGGCAGCAAGTTTTCGAGCCCACCTGGAAACACATTGGGGATGGCTG 

CTGCCTCACCAGAGAGACCTGGAAGGATCTTGAGAACGCCCAGTTCTCCGAAATCCAAATGG 

AACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTGGGCCCCACATCATGGGAAAGGCTGTC 

AAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCCTTCCCCAGCCTCCAATTAGAACA 

AGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTAGCAGAATGAGAGAAGACATT 

CATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGCAATCTCTAACTTCAATC 

CCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGGAAACACTAGGACCC 

TGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTCCCAATGTTGTC 

CCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACACCCATGCGT 

CTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGACCCTCT 

CTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGGAT 

GCCAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATATTTTTTAATAAATAGACGA 

AACCACG 
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FIGURE 103 

MDILVPLLQLLVLLLTLPUHIJ4ALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQI 
KGLTGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPKFEKFLTKSMAENRHLQYERFW 
APGEDMRQIADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFM 
WQQVFEPTWKHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFP 
SSKALI CS FPSLQLEQATHQP I YLPLRGT 
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FTGURE 104 

GTGGGATTTATTTGAGTGCAAGATCGTTTTCTCAGTGGTGGTGGAAGTTGCCTCATCGCAGG 
CAGATGTTGGGGCTTTGTCCGAACAGCTCCCCTCTGCCAGCTTCTGTAGATAAGGGTTAAAA 
ACTAATATTTATATGACAGAAGAAAAAGATGTCATTCCGTAAAGTAAACATCATCATCTTGG 
TCCTGGCTGTTGCTCTCTTCTTACTGGTTTTGCACCATAACTTCCTCAGCTTGAGCAGTTTG 
TTAAGGAATGAGGTTACAGATTCAGGAATTGTAGGGCCTCAACCTATAGACTTTGTCCCAAA 
TGCTCTCCGACATGCAGTAGATGGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATCTG 
AAGACAGGCTTGGGGGGGCCATTGCAGCTATAAACAGCATTCAGCACAACACTCGCTCCAAT 
GTGATTTTCTACATTGTTACTCTCAACAATACAGCAGACCATCTCCGGTCCTGGCTCAACAG 
TGATTCCCTGAAAAGCATCAGATACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAA 
AAGT AAAGG AGG AT CCTG AC C AGG GG GAAT CCATG AAACCTTTAACCTTTG CAAGG TT CTAC 
TTGCCAATTCTGGTTCCCAGCGCAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCA 
AGGTGATATTCTTGCCCTTTACAATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAG 
AAGATTGTGATTCAGCCTCTACTAAAGTTGTCATCCGTGGAGCAGGAAACCAGTACAATTAC 
ATTGGCTATCTTGACTATAAAAAGGAAAGAATTCGTAAGCTTTCCATGAAAGCCAGCACTTG 
CTCATTTAATCCTGGAGTTTTTGTTGCAAACCTGACGGAATGGAAACGACAGAATATAACTA 
ACCAACTGGAAAAATGGATGAAACTCAATGTAGAAGAGGGACTGTATAGCAGAACCCTGGCT 
GGTAGCATCACAACACCTCCTCTGCTTATCGTATTTTATCAACAGCACTCTACCATCGATCC 
TATGTGGAATGTCCGCCACCTTGGTTCCAGTGCTGGAAAACGATATTCACCTCAGTTTGTAA 
AGGCTGCCAAGTTACTCCATTGGAATGGACATTTGAAGCCATGGGGAAGGACTGCTTCATAT 
ACTGATGTTTGGGAAAAATGGTATATTCCAGACCCAACAGGCAAATTCAACCTAATCCGAAG 
ATATACCGAGATCTCAAACATAAAGTGAAACAGAATTTGAACTGTAAGCAAGCATTTCTCAG 
GAAGTCCTGGAAGATAGCATGCATGGGAAGTAACAGTTGCTAGGCTTCAATGCCTATCGGTA 
GCAAGCCATGGAAAAAGATGTGTCAGCTAGGTAAAGATGACAAACTGCCCTGTCTGGCAGTC 
AGCTTCCCAGACAGACTATAGACTATAAATATGTCTCCATCTGCCTTACCAAGTGTTTTCTT 
ACTACAATGCTGAATGACTGGAAAGAAGAACTGATATGGCTAGTTCAGCTAGCTGGTACAGA 
TAATTCAAAACTGCTGTTGGTTTTAATTTTGTAACCTGTGGCCTGATCTGTAAATAAAACTT 
ACATTTTTC 
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FIGURE 1 05 



MSFRKWIIILVLAVALFLLVLHHNFLSLSSLLRNEVTDSGIVGPQPIDFVPNALRHAVDGR 
QEEIPWIAASEDRLGGAIAAINSIQHNTRSNVIFYIVTLNNTADHLRSWLNSDSLKSIRYK 
IWFDPKLLEGICVKEDPDQGESMKPLTFARFYLPILVPSAKKAIYMDDDVIVQGDILAL 
ALKPGHAAAFSEDCDSASTKWIRGAGNQYNYIGYLDYKKERIRKLSMKASTCSFNPGVFVA 

nltewkrqnitnqlek™klnveeglysrti^gsittppllivfyqqhstidpmwnvrhlgs 

SAGKRYSPQFVKAAKLLHWNGHLKPWGRTASYTDVWEKWYI PDPTGKFNL I RRYTE I SNI K 
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FIGURE 106 

TGGTTTTTGCCCCATAAATTCCCTCAGCTTGAGCAGTTTGTTAAGGAATGAGGTTACAGATT 
CAGGAATTNTAGGNCCTCAACCTNTAGANTTTGTCCCAAATGTTCTCCGACATGCAGTAGAT 
GGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATNTGAAGACAGGCTTGGGGGGGCCAT 
TGCAGCTATAAACAGCATTCAGCACAACACTCGNTCCAATGTGATTTTCTACATTGTTACTC 
TCAACAATACAGCAGACCATNTCCGGTCCTGGNTCAACAGTGATTCCCTGAAAAGCATCAGA 
TACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCA 
GGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGTTCCCAGCG 
CAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTAC 
AATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTAC 
TAAAGTTGTCATCCGTGGAGCAGGAAA 



WO 99/63088 , A . ^ PCT/US99/12252 

FIGURE 107 

CGACGCTCTAGCGGTTACCGCTGCGGGCTGGCTGGGCGTAGTGGGGCTGCGCGGCTGCCACG 
GAGCTAGAGGGCAAGTGTGCTCGGCCCAGCGTGCAGGGAACGCGGGCGGCCAGACAACGGGC 
TGGGCTCCGGGGCCTGCGGCGCGGGCGCTGAGCTGGCAGGGCGGGTCGGGGCGCGGGCTGCA 
TCCGCATCTCCTCCATCGCCTGCAGTAAGGGCGGCCGCGGCGAGCCTTTGAGGGGAACGACT 
TGTCGGAGCCCTAACCAGGGGTGTCTCTGAGCCTGGTGGGATCCCCGGAGCGTCACATCACT 
TTCCGATCACTTCAAAGTGGTTAAAAACTAATATTTATATGACAGAAGAAAAAGATGTCATT 
CCGTAAAGTAAACATCATCATCTTGGTCCTGGGCTGTTGCTCTCTTCTTACTGGTTTTGCAC 
CATAACTTCCTCAGCTTGAGGCAGTTTGTTAAGGAATGAGGTTACAGATTCAGGAATTGTAG 
GGCCTCAACCTATAGGACTTTGTCCCAAATGCTCTCCGACATGCAGTAGATGGGAGACAAGA 
GGAGATTCCTGTGGTCATCGCTGCATCTGAAGACAGGCTTGGGGGGGCCATTGCAGCTATAA 
ACAGCATTCAGCACAACACTCGCTCCAATGTGATTTTCTACATTGTTACTCTCAACAATACA 
GCAGACCATCTCCGGTCCTGGGCTCAACAGTGATTCCCTGAAAAGCATCAGATACAAAATTG 
TCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCAGGGGGAATCC 
ATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGGTTCCCAGCGCAAAGAAGG 
CCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTACAATACAGCA 
CTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTACTAAAGTTGT 
CATCCGTGGAGCAGGAAACCAGTACAATTACATTGGCTATCTTGACTATAAAAAGGAAAGAA 
TTCGTAAGCTTTCCATGAAAGCCAGCACTTGCTCATTTAATCCTGGAGTTTTTGTTGCAAAC 
CTGACGGAATGGAAACGACAGAATATAACTAACCAACTGGAAAAATGGATGAAACTCAATGT 
AGAAGAGGGACTGTATAGCAGAACCCTGGCTGGTAGCATCACAACACCTCCTCTGCTTATCG 
TATTTTATCAACAGCACTCTACCATCGATCCTATGTGGAATGTCCGCCACCTTGGTTCCAGT 
GCTGGAAAACGATATTCACCTCAGTTTGTAAAGGCTGCCAAGTTACTCCATTGGAATGGACA 
TTTGAAGCCATGGGGAAGGACTGCTTCATATACTGATGTTTGGGGAAAAATGGTATATTCCA 
GACCCAACAGGCAAATTCAACCTAATCCGAAGATATACCGAGATCTCAAACATAAAGTGAAA 
CAGAATTTGAACTGTAAGCAAGCATTTCTCAGGAAGTCCTGGAAGATAGCATGCGTGGGAAG 
TAACAGTTGCTAGGCTTCAATGCCTATCGGTAGCAAGCCATGGAAAAAGATGTGTCAGCTAG 
GTAAAGATGACAAACTGCCCTGTCTGGCAGTCAGCTTCCCAGACAGACTATAGACTATAAAT 
ATGTCTCCATCTGCCTTACCAAGTGTTTTCTTACTACAATGCTGAATGACTGGAAAGAAGAA 
CTGATATGGCTAGTTCAGCTAGCTGGTACAGATAATTCAAAACTGCTGTTGGTTTTAATTTT 
GTAACCTGTGGCCTGATCTGTAAATAAAACTTACATTTTTCAATAGGTAAAAAAAAAAAAAA 
AAAAAA 
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FIGURE 108 

CTGCAGGTAGACATCTCCACTGCCCAGGAATCACTGAGCGTGCAGACAGCACAGCCTCCTCT 
GAAGGCCGGCCATACCAGAGTCCTGCCTCGGCATGGGCCTCACCATTGAGGCAGCTCCACTG 
TCTGTGCTGGTCTGAGGGTGCTGCCTGTCATGGGGGCAGCCATCTCCCAGGGGGCCCTCATC 
GCCATCGTCTGCAACGGTCTCGTGGGCTTCTTGCTGCTGCTGCTCTGGGTCATCCTCTGCTG 
GGCCTGCCATTCTCGTCTGCCGACGTTGACTCTCTCTCTGAATCCAGTCCCAACTCCAGCCC 
TGGCCCCTGTCCTGAGAAGGCCCCACCACCCCAGAAGCCCAGCCATGAAGGCAGCTACCTGC 
TGCAGCCCTGAAGGCCCCTGGCCTAGCCTGGAGCCCAGGACCTAAGTCCACCTCACCTAGAG 
CCTGGAATTAGGATCCCAGAGTTCAGCCAGCCTGGGGTCCAGAACTCAAGAGTCCGCCTGCT 
TGGAGCTGGACCCAGCGGCCCAGAGTCTAGCCAGCTTGGCTCCAATAGGAGCTCAGTGGCCC 
TAAGGAGATGGGCCTGGGGTGGGGGCTTATGAGTTGGTGCTAGAGCCAGGGCCATCTGGACT 
ATGCTCCATCCCAAGGGCCAAGGGTCAGGGGCCGGGTCCACTCTTTCCCTAGGCTGAGCACC 
TCTAGGCCCTCTAGGTTGGGGAAGCAAACTGGAACCCATGGCAATAATAGGAGGGTGTCCAG 
GCTGGGCCCCTCCCCTGGTCCTCCCAGTGTTTGCTGGATAATAAATGGAACTATGGCTCTAA 
AAAAAAAAAAAAAAAAA 
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FIGURE 109 

MGAAISQGALIAIVCNGLVGFLLLLLWILCWACHSRLPTLTLSLNPVPTPALAPVX.RRPHH 
PRSPAMKAATCCSPEGPWPSLEPRT 
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FIGURE 110 

GTTTGAATTCCTTCAACTATACCCAC^GTCC^AAGCAGACTCACTGTGTCCCAGGCTACCA 
GTTCCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTA 
CTCCCTATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAA 
TCATGTCGGGAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCC 
ATGATGTTTACCTTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTAT 
TTTGGGATTGTTGTTTGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACC 
TCAGCATAGAATTGGACACAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTA 
TCCACAGGCATCACGGCAGTGCTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATT 
GACAGTTGAGCTTTTCCAAATCACAAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCC 
AGCCACTGTGGACATTTGCCATCCTCATTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTG 
AGCCTGGGAACTGCAGGAGCTGCCCAGGTTATGGAAGGCGGCCAAGTGGAATATAAGCCCCT 
TTCGGGCATTCGGTACATGTGGTCGTACCATTTAATTGGCCTCATCTGGACTAGTGAATTCA 
TCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAGTGGTTACTTGTTATTTCAACAGAAGT 
AAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTCTCCATTCTCTTCTTCTACCATCA 
AGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAGGATTCCGAGAATCATTGTCA 
TGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGTCCAGGTACCTGTTCCGA 
TGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTCAACCAGAATGCATA 
TACTACAACTGCTATTAATGGGACAGATTTCTGTACATCAGCAAAAGATGCATTCAAAATCT 
TGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAATTTTTCTA 
GGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTACAATCG 
GGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGCCC 
ATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGAT 
CTGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTT 
CGTAAAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGA 
ATGAGGAGGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCATTTAGGTATCTGTACCT 
GGAAAACATTTCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTT 
AGTGAATTTTTTTTTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 111 

MSGRDTILGLCILALALSLAMMFTFRFITTLL 

S IELDTERENMKCVLGFAI VSTG I TAVLLVL I FVLRKR I KLTVELFQ I TNKAI S SAPFLLFQ 

PLWTFAILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFI 
lACQQOTIAGAWTCn^TRSKNDPP 

YMQNALKEQQHGALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKIL 

SKNSSHFTSINCFGDFIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAH 

SFLSVFETVLDALFLCFAVDLETNDGSSEKPYFMDQEFLSFVKRSNKI^ARAQQDKHSLRN 
EEGTELQAIVR 
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FIGURE 112 

GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTT 

AGAATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTG 

TGTGGTGAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATT 

TATGAGGACTGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGG 

TGACTGGAGTACATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTG 

CCTCAGATCAACTGCGATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATG 

TCCAGCAGGATGCCAAGACCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACT 

CCAGTGTGTGTGGCGCTGCCGTACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTT 

GTTCGGAAGGTTGCTGGACAGTCTGGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTT 

ATCCCTACCACGATGGAGAGAATCCTTTATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAA 

CCTACCCATCAGCTCTTACATACTCATCATCGAAAAGTCCAGCTGCCCAAGCAGGTGAGACC 

ACAAAAGCCTATCAGAGGCCACCTATTCCAGGGACAACTGCACAGCCGGTCACTCTGATGCA 

GCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCACCACCTTGCCAAGGCCATCCCCTTCTG 

CTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGGGCCACAGGAGCCAGGAGATGGAT 

CTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCCAGAGCTGATCCAGGTATCCA 

AAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGCGGATGTCAGCCTGGGAC 

TTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCCTGGGAGATCCAAAC 

TGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAACGGCGATTCCG 

AATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGCCGGTCCAC 

TGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACACACACG 

AATTCTCGAGATCTGAAGACAGCCATAGAGAAAATTACTCAGAGAGGAGGACTTTCTAATGT 

AGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCG 

GGGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCT 

TCAAGACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGA 

AAATGAGAAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACG 

GCTTCTACTCGCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTG 

AAGCGGGTCTGCGACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACAT 

TGGCTTCGTCATCGACGGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGT 

TTGTGACCAACCTCACCAAAGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTG 

CAGTACACCTACGAACAGCGGCTGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACAT 

CCTCAACGCCATCAAGAGGGTGGGCTACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCA 

ACTTCGCCCTGGAGCAGCTCTTCAAGAAGTCCAAGCCCAACAAGAGGAAGTTAATGATCCTC 

ATCACCGACGGGAGGTCCTACGACGACGTCCGGATCCCAGCCATGGCTGCCCATCTGAAGGG 

AGTGATCACCTATGCGATAGGCGTTGCCTGGGCTGCCCAAGAGGAGCTAGAAGTCATTGCCA 

CTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGAGTTTGACAACCTCCATCAGTATGTC 

CCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCACAGCCTCGGAACTCAATTCAGAG 

CAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTTGGACCACCCCACCGCTTAA 

TGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAACAAATGTCTTGTTATTA 

TTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGATGATCACAAACGT 

ATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATTTTACATTTTGACAATT 

GTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAGAGCTTTT 

GTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCATTTTT 

GTCATGACAATGTAGGAATTGCTGAATTAAATGTTTAGAAGGATGAAAAATAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 113 

MRTVVLTMICASVI EMFLVLLVTGVHSNKETAKKI KR PKFTVPQ INCDVKAGKI I DPEF I VKC 
PAGCQDPKyHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSL 
SLPRVmESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQ 
LLAVTVAVATPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQ 
RQDPSGAAFQKPVGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFR 
IQKQLLADVAQALDIGPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNV 
GRAISFVTKNFFSKANGTOSGAPNVVVVMTO^ 

NEKQYWEPNFANKAVCRTNGFYSLJfVQSV^GLHKTLQPLVKRVCDTDRl^ 

GF V I DG S S S VGTGNFRTVLQ FVTNLTKEFE I SDTDTR I GAVQ YT YEQRLE FGFDKYSS KPD I 

LN AI KRVG YW S GGT S TGAAINFALEQLFKKSKPNKRKLM I L I TDGR S YDDVR I P AMAAHLKG 

VITYAIGVAWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQNICTEFNSQPRN 
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FIGURE U4 

CAGGATGAACTGGTTGCAGTGGCTGCTGCTGCTGCGGGGGCGCTGAGAGGACACGAGCTCTA 
TGCCTTTCCGGCTGCTCATCCCGCTCGGCCTCCTGTGCGCGCTGCTGCCTCAGCACCATGGT 
GCGCCAGGTCCCGACGGCTCCGCGCCAGATCCCGCCCACTACAGTTTTTCTCTGACTCTAAT 
TGATGCACTGGACACCTTGCTGATTTTGGGGAATGTCTCAGAATTCCAAAGAGTGGTTGAAG 
TGCTCCAGGACAGCGTGGACTTTGATATTGATGTGAACGCCTCTGTGTTTGAAACAAACATT 
CGAGTGGTAGGAGGACTCCTGTCTGCTCATCTGCTCTCCAAGAAGGCTGGGGTGGAAGTAGA 
GGCTGGATGGCCCTGTTCCGGGCCTCTCCTGAGAATGGCTGAGGAGGCGGCCCGAAAACTCC 
TCC CAG C CTTTCAG AC C C CC ACTGGCATGC CATATG GAAC AG TGAACTTACTTCATGGCGTG 
AACCCAGGAGAGACCCCTGTCACCTGTACGGCAGGGATTGGGACCTTCATTGTTGAATTTGC 
CACCCTGAGCAGCCTCACTGGTGACCCGGTGTTCGAAGATGTGGCCAGAGTGGCTTTGATGC 
GCCTCTGGGAGAGCCGGTCAGATATCGGGCTGGTCGGCAACCACATTGATGTGCTCACTGGC 
AAGTGGGTGGCCCAGGACGCAGGCATCGGGGCTGGCGTGGACTCCTACTTTGAGTACTTGGT 
GAAAGGAGCCATCCTGCTTCAGGATAAGAAGCTCATGGCCATGTTCCTAGAGTATAACAAAG 
CCATCCGGAACTACACCCGCTTCGATGACTGGTACCTGTGGGTTCAGATGTACAAGGGGACT 
GTGTCCATGCCAGTCTTCCAGTCCTTGGAGGCCTACTGGCCTGGTCTTCAGAGCCTCATTGG 
AGAC ATTG A CAAT G C CATG AGGAC CTT C CT C AA C T A CT AC AC TG T ATGG AAG CAGTTTGGGG 
GGCTCCCGGAATTCTACAACATTCCTCAGGGATACACAGTGGAGAAGCGAGAGGGCTACCCA 
CTTCGGCCAGAACTTATTGAAAGCGCAATGTACCTCTACCGTGCCACGGGGGATCCCACCCT 
CCTAGAACTCGGAAGAGATGCTGTGGAATCCATTGAAAAAATCAGCAAGGTGGAGTGCGGAT 
TTGCAACAATCAAAGATCTGCGAGACCACAAGCTGGACAACCGCATGGAGTCGTTCTTCCTG 
GCCGAGACTGTGAAATACCTCTACCTCCTGTTTGACCCAACCAACTTCATCCACAACAATGG 
GTCCACCTTCGACGCGGTGATCACCCCCTATGGGGAGTGCATCCTGGGGGCTGGGGGGTACA 
TCTTCAACACAGAAGCTCACCCCATCGACCTTGCCGCCCTGCACTGCTGCCAGAGGCTGAAG 
GAAGAGCAGTGGGAGGTGGAGGACTTGATGAGGGAATTCTACTCTCTCAAACGGAGCAGGTC 
GAAATTTCAGAAAAACACTGTTAGTTCGGGGCCATGGGAACCTCCAGCAAGGCCAGGAACAC 
TCTTCTCACCAGAAAACCATGACCAGGCAAGGGAGAGGAAGCCTGCCAAACAGAAGGTCCCA 
CTTCTCAGCTGCCCCAGTCAGCCCTTCACCTCCAAGTTGGCATTACTGGGACAGGTTTTCCT 
AGACTCCTC ATAA CCACTGGATAATTTTTTTATTTTTATTTTTTTGAGGCTAAACTATAATA 
AATTGCTTTTGGCTATCATAAAA 
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FIGURE 115 

MPFRLLIPLGLLCALLPQHHGAPGPDGSAPDPAHYSFSLTLIDALDTLLILGNVSEFQRWE 
VLQDSVDFDIDVNASVFETNIRWGGLLSAHLLSKK^^ 

LPAFQTPTGMPYGTVNLLHGVNPGETPVTCTAGIGTFIVEFATLSSLTGDPVFEDVARVALM 
RLWESRSDIGLVGNHIDVLTGKVTVAQDAGIGAGVDSYFEYLVKGAILLQDKKIJ^AMFLEYNK 
AIRNYTRFDDWYLOTQWYKGTVSMPVFQSLEAYW 

GLPEFYNI PQGYTVEKREGYPLRPEL I E S AMYLYRATGDPTLLELGRDAVES IEKIS KVE CG 
FAT I KDLRDHKLDNRMES FFLAETVKYL YLLFD P TNF I HNNG S TFDAVI TP YGE CI LGAGGY 
I FNTEAHP I DLAALHCCQRLKEEQWE VEDLMREFYSLKRS RSKFQKNTVSSGPWEPPARPGT 
LFSPENHDQARERKPAKQKVPLLSCPSQPFTSKLALLGQVFLDSS 



WO 99/O088 / PCT7US99/I2252 

FIGTJRE 116 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 
GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 
ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 
GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 
AATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 
ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 
TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 
AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 
GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 
ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 
GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 
TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 
CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGtCAAAATGGTGAGGAGTGG 
GGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGA 
CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 
GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 
GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 
TGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 
GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 
CTCATAGGTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 
ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 
GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 
TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 
CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 
TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 
TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 
TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 
GGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGG 
TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 
AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 
CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 
GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 
TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 
AAAAAAAAAA 



WO 99/63088 , PCT7US99/12252 

FIGURE 117 

MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAPGE 
TVYYSVEYOGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 
SILKHPFmNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVKMVRSG 
GI PVHLE TME PGAAYC VKAQTFVKA I GR YS AFS QTE CVEVQGEA I P LVLALFAFVGFML I LV 
WPLFVV™GRLLQYSCCPVVVLPDTLKITNSPQ 

Important features : 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation sites. 

amino acids 40-43 and 134-137 

Tissue factor proteins homology. 

amino acids 92-119 

Integrins alpha chain protein homology. 

amino acids 232-262 
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FIGURE 11S 
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TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCMCATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCCAC 
ATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 
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FIGURE 119 

CGGACGCGTGGGCCGCCACCTCCGGAACAAGCCATGGTGGCGGCGACGGTGGCAGCGGCGTG 

GCTGCTCCTGTGGGCTGCGGCCTGCGCGCAGCAGGAGCAGGACTTCTACGACTTCAAGGCGG 

TCAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGTCCCTGGTGGTG 

AATGTGGCCAGCGAGTGCGGCTTCACAGACCAGCACTACCGAGCCCTGCAGCAGCTGCAGCG 

AGACCTGGGCCCCCACCACTTTAACGTGCTCGCCTTCCCCTGCAACCAGTTTGGCCAACAGG 

AGCCTGACAGCAACAAGGAGATTGAGAGCTTTGCCCGCCGCACCTACAGTGTCTCATTCCCC 

ATGTTTAGCAAGATTGCAGTCACCGGTACTGGTGCCCATCCTGCCTTCAAGTACCTGGCCCA 

GACTTCTGGGAAGGAGCCCACCTGGAACTTCTGGAAGTACCTAGTAGCCCCAGATGGAAAGG 

TGGTAGGGGCTTGGGACCCAACTGTGTCAGTGGAGGAGGTCAGACCCCAGATCACAGCGCTC 

GTGAGGAAGCTCATCCTACTGAAGCGAGAAGACTTATAACCACCGCGTCTCCTCCTCCACCA 

CCTCATCCCGCCCACCTGTGTGGGGCTGACCAATGCAAACTCAAATGGTGCTTCAAAGGGAG 

AGACCCACTGACTCTCCTTCCTTTACTCTTATGCCATTGGTCCCATCATTCTTGTGGGGGAA 

AAATTCTAGTATTTTGATTATTTGAATCTTACAGCAACAAATAGGAACTCCTGGCCAATGAG 

AGCTCTTGACCAGTGAATCACCAGCCGATACGAACGTCTTGCCAACAAAAATGTGTGGCAAA 

TAGAAGTATATCAAGCAATAATCTCCCACCCAAGGCTTCTGTAAACTGGGACCAATGATTAC 

CTCATAGGGCTGTTGTGAGGATTAGGATGAAATACCTGTGAAAGTGCCTAGGCAGTGCCAGC 

CAAATAGGAGGCATTCAATGAACATTTTTTGCATATAAACCAAAAAATAACTTGTTATCAAT 

AAAAACTTGCATCCAACATGAATTTCCAGCCGATGATAATCCAGGCCAAAGGTTTAGTTGTT 

GTTATTTCCTCTGTATTATTTTCTTCATTACAAAAGAAATGCAAGTTCATTGTAACAATCCA 

AACAATACCTCACGATATAAAATAAAAATGAAAGTATCCTCCTCAAAAA 
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FIGURE 120 

MVAATVAAAWLLLWAAACAQQEQDFYDFKAVN I RGKLVS LEKYRGS VS LWNVASE CG FTDQ 
HYRALQQLQRDLGPHHFNVIAFPCNQFGQQEPDSNKEIESFARRTYSVSFPMFSKIAVTGTG 
AHPAFKYI^QTSGK£PTWNFWKTLVAPDGK^ 
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FIGURE 121 

CGGACGCGTGGGCGGGCCGGGACGCAGGGCAAAGCGAGCCMGGCTGTCTACGTCGGGATGC 
TGCGCCTGGGGAGGCTGTGCGCCGGGAGCTCGGGGGTGCTGGGGGCCCGGGCCGCCCTCTCT 
CGGAGTTGGCAGGAAGCCAGGTTGCAGGGTGTCCGCTTCCTCAGTTCCAGAGAGGTGGATCG 
CATGGTCTCCACGCCCATCGGAGGCCTCAGCTACGTTCAGGGGTGCACCAAAAAGCATCTTA 
ACAGCAAGACTGTGGGCCAGTGCCTGGAGACCACAGCACAGAGGGTCCCAGAACGAGAGGCC 
TTGGTCGTCCTCCATGAAGACGTCAGGTTGACCTTTGCCCAACTCAAGGAGGAGGTGGACAA 
AGCTGCTTCTGGCCTCCTGAGCATTGGCCTCTGCAAAGGTGACCGGCTGGGCATGTGGGGAC 
CTAACTCCTATGCATGGGTGCTCATGCAGTTGGCCACCGCCCAGGCGGGCATCATTCTGGTG 
TCTGTGAACCCAGCCTACCAGGCTATGGAACTGGAGTATGTCCTCAAGAAGGTGGGCTGCAA 

GG CC CTTG TGTT C C CCAAGCAATT CAAGAC CCAG CAATAC TACAACGTCCTGAAG CAGATCT 
GTCCAGAAGTGGAGAATGCCCAGCCAGGGGCCTTGAAGAGTCAGAGGCTCCCAGATCTGACC 
ACAGTCATCTCGGTGGATGCCCCTTTGCCGGGGACCCTGCTCCTGGATGAAGTGGTGGCGGC 
TGGCAGCACACGGCAGCATCTGGACCAGCTCCAATACAACCAGCAGTTCCTGTCCTGCCATG 
ACCCCATCAACATCCAGTTCACCTCGGGGACAACAGGCAGCCCCAAGGGGGCCACCCTCTCC 
CACTACAACATTGTCAACAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGAC 
ACCAGAGCAGTTGCGGATGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAG 
GCACAATGATGTGTCTGATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGC 
AAGAAGGCACTGGAGGCCATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGAT 
GTTCGTGGACATTCTGAACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAG 
GTGTCATTGCTGGGTCCCCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAAT 
ATGAAGGACCTGGTGGTTGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTT 
CCCTGAGGACACTGTGGAGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGG 
CCCGGATCATGAACATGGAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGC 
ATCCGAGGGTACTGCGTCATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGT 
GGATCAGGACAAGTGGTATTGGACAGGAGATGTCGCCACAATGAATGAGCAGGGCTTCTGCA 
AGATCGTGGGCCGCTCTAAGGATATGATCATCCGGGGTGGTGAGAACATCTACCCCGCAGAG 
CTCGAGGACTTCTTTCACACACACCCGAAGGTGCAGGAAGTGCAGGTGGTGGGAGTGAAGGA 
CGATCGGATGGGGGAAGAGATTTGTGCCTGCATTCGGCTGAAGGACGGGGAGGAGACCACGG 
TGGAGGAGATAAAAGCTTTGTGCAAAGGGAAGATCTCTCACTTCAAGATTCCGAAGTACATC 
GTGTTTGTCACAAACTACCCCCTCACCATTTCAGGAAAGATCCAGAAATTCAAACTTCGAGA 
GC AGATGGAA CGAC AT CT AAAT CTGTGAATAAAG CAGCAGG CCTGTCCTGG C CGGTTGGCTT 
GACT CT CT CCTGT C AGAATG CAACCTGGCTTTATGC ACCTAGATGTCCCC AGCACC CAGTTC 
TGAGCCAGGCACATCAAATGTCAAGGAATTGACTGAACGAACTAAGAGCTCCTGGATGGGTC 
CGGGAACTCGCCTGGGCACAAGGTGCCAAAAGGCAGGCAGCCTGCCCAGGCCCTCCCTCCTG 
TCCATCCCCCACATTCCCCTGTCTGTCCTTGTGATTTGGCATAAAGAGCTTCTGTTTTCTTT 
GAAAAAAAAAAAAAAAA 
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FIGURE 122 



MAVyVGMLRLGRLCAGSSGVLGARAALSRSWQEARLQGVRFLSSREVDRJVTV'STPrGGLSYVQ 
GCTKKHLNSKTVGQCLETTAQRVPEREALVVLHEDVRLTFAQLKEEVDKAASGLLSIGLCKG 
DRLGMWGPNSYAWVLMQLATAQAG 1 1 LVSVNPAYQAMELEYVLKKVGCKALVFPKQFKTQQY 
YNVLKQICPEVENAQPGALKSQRLPDLTTVISVDAPLPGTLLLDEWAAGSTRQHLDQLQYN 
QQFLSCHDPINIQFTSGTTGSPKGATLSHYNIVNNSNILGERLKLHEKTPEQLRMILPNPLY 
HCLGSVAGTMMCLMYGATLI LAS P I FNGKKALEA I SRERGTFLYGTPTMFVD ILNQPDFSS Y 
DISTMCGGVIAGSPAPPELIRAIINKINMKDLWAYGTTENSPVTFAHFPEDTVEQKAESVG 
RIMPHTEARIMNMEAGTLAKLNTPGELCIRGYCV^ 

MNEQGFCKIVGRSKDMIIRGGENIYPAELEDFFHTHPKVQEVQWGVKDDRMGEEICACIRL 
KDGEETTVEE I KAFCKGK I SHFKI PKYI VFVTNYPLT I SGKIQKFKLREQMERHLNL 
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FIGURE 123 

CAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGACACCAGAGCAGTTGCGGA 
TGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAGGCACAATGATGTGTCTG 
ATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGCAAGAAGGCACTGGAGGC 
CATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGATGTTCGTGGACATTCTGA 
ACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAGGTGTCATTGCTGGGTCC 
CCTGCACCTC CAGAGT TG AT C CGAG C CAT CAT CAAC AAG AT AAAT ATG AAGG ACCTG GTG G T 
TGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTTCCCTGAGGACACTGTGG 
AGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGGCGCGGATCATGAACATG 
GAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGCATCCGAGGGTACTGCGT 

CATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGTGGATCAGGACAAGTGGT 
ATTGGACAGGAGATGTCGCCAC 
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FIGURE 124 

GAGCAGGACGGAGCCATGGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGC 
AGGCTGGCTGCTGCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCG 
TGCAGAAAGCAGATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAAGTGCGCGCCGGGC 
GTGGACGTCTGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACAATTCTCGCTGGC 
AGTGCGGGGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGCGGCCTGGATCTTCACGGGC 
TTCTGGCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCCAAGCTCAACCTC 
ACCTCGCGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCCAACGGCGTGGAGTG 
CTACAGCTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGCT 
GCTACAACGCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAACGTCACCTTGACGGCA 
GCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAGGATGAATTCTGCACTCGGGA 
TGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTCCTGTTGCCAGGGGTCCCGCTGTAACT 
CTGACCTCCGCAACAAGACCTACTTCTCCCCTCGAATCCCACCCCTTGTCCGGCTGCCCCCT 
CCAGAGCCCACGACTGTGGCCTCAACCACATCTGTCACCACTTCTACCTCGGCCCCAGTGAG 
ACCCACATCCACCACCAAACCCATGCCAGCGCCAACCAGTCAGACTCCGAGACAGGGAGTAG 
AACACGAGGCCTCCCGGGATGAGGAGCCCAGGTTGACTGGAGGCGCCGCTGGCCACCAGGAC 
CGCAGCAATTCAGGGCAGTATCCTGCAAAAGGGGGGCCCCAGCAGCCCCATAATAAAGGCTG 
TGTGGCTCCCACAGCTGGATTGGCAGCCCTTCTGTTGGCCGTGGCTGCTGGTGTCCTACTGT 
GAGCTTCTCCACCTGGAAATTTCCCTCTCACCTACTTCTCTGGCCCTGGGTACCCCTCTTCT 
CATCACTTCCTGTTCCCACCACTGGACTGGGCTGGCCCAGCCCCTGTTTTTCCAACATTCCC 
CAGTATCCCCAGCTTCTGCTGCGCTGGTTTGCGGCTTTGGGAAATAAAATACCGTTGTATAT 
ATTCTGCCAGGGGTGTTCTAGCTTTTTGAGGACAGCTCCTGTATCCTTCTCATCCTTGTCTC 
TCCGCTTGTCCTCTTGTGATGTTAGGACAGAGTGAGAGAAGTCAGCTGTCACGGGGAAGGTG 
AGAGAGAGGATGCTAAGCTTCCTACTCACTTTCTCCTAGCCAGCCTGGACTTTGGAGCGTGG 
GGTGGGTGGGACAATGGCTCCCCACTCTAAGCACTGCCTCCCCTACTCCCCGCATCTTTGGG 
GAATCGGTTCCCCATATGTCTTCCTTACTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTAC 
CCAATTCGCCCTATAGTGAGTCGTA 
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FIGURE 125 

MDPARKAGAQAMIWTAGWLLLLLLRGGAQALECYSCVQKADDGCSPNKMKTVKCAPGVDVCT 
EAVGAVETIHGQFSLAVRGCGSGLPGKNDRGLDLHGLLAFIQLQQCAQDRCNAKLKLTSRAL 
DPAGNESAYPPNGVECYSOTGLSREACQGTSPPWSCYNASDHVYKGCFDGNVTLTAANVTV 
SLPVRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVRLPPPEPTT 
VASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAAGHQDRSNSG 
QYPAKGG PQQ PHNKGCVAPTAGLAALLLAVAAG VLL 
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FIGURE 126 

CGGGACTCGGCGGGTCCTCCTGGGAGTCTCGGAGGGGACCGGCTGTGCAGACGCCATGGAGT 
TGGTGCTGGTCTTCCTCTGCAGCCTGCTGGCCCCCATGGTCCTGGCCAGTGCAGCTGAAAAG 
GAGAAGGAAATGGACCCTTTTCATTATGATTACCAGACCCTGAGGATTGGGGGACTGGTGTT 
CGCTGTGGTCCTCTTCTCGGTTGGGATCCTCCTTATCCTAAGTCGCAGGTGCAAGTGCAGTT 
TCAATCAGAAGCCCCGGGCCCCAGGAGATGAGGAAGCCCAGGTGGAGAACCTCATCACCGCC 
AATGCAACAGAGCCCCAGAAGCAGAGAACTGAAGTGCAGCCATCAGGTGGAAGCCTCTGGAA 
CCTGAGGCGGCTGCTTGAACCTTTGGATGCAAATGTCGATGCTTAAGAAAACCGGCCACTTC 
AGCAACAGCCCTTTCCCCAGGAGAAGCCAAGAACTTGTGTGTCCCCCACCCTATCCCCTCTA 
ACACCATTCCTCCACCTGATGATGCAACTAACACTTGCCTCCCCACTGCAGCCTGCGGTCCT 
GCCCACCTCCCGTGATGTGTGTGTGTGTGTGTGTGTGTGACTGTGTGTGTTTGCTAACTGTG 
GTCTTTGTGGCTACTTGTTTGTGGATGGTATTGTGTTTGTTAGTGAACTGTGGACTCGCTTT 
CCCAGGCAGGGGCTGAGCCACATGGCCATCTGCTCCTCCCTGCCCCCGTGGCCCTCCATCAC 
CTTCTGCTCCTAGGAGGCTGCTTGTTGCCCGAGACCAGCCCCCTCCCCTGATTTAGGGATGC 
GTAGGGTAAGAGCACGGGCAGTGGTCTTCAGTCGTCTTGGGACCTGGGAAGGTTTGCAGCAC 
TTTGTCATCATTCTTCATGGACTCCTTTCACTCCTTTAACAAAAACCTTGCTTCCTTATCCC 
AC CT G AT C C C AGT CT G AAGGTCT CTT AG CAACTGGAG ATACAAAGC AAGG AGCTGGTG AGCC 
CAGCGTTGACGTCAGGCAGGCTATGCCCTTCCGTGGTTAATTTCTTCCCAGGGGCTTCCACG 
AGGAGTCCCCATCTGCCCCGCCCCTTCACAGAGCGCCCGGGGATTCCAGGCCCAGGGCTTCT 
ACTCTGCCCCTGGGGAATGTGTCCCCTGCATATCTTCTCAGCAATAACTCCATGGGCTCTGG 
GACCCTACCCCTTCCAACCTTCCCTGCTTCTGAGACTTCAATCTACAGCCCAGCTCATCCAG 
ATGCAGACTACAGTCCCTGCAATTGGGTCTCTGGCAGGCAATAGTTGAAGGACTCCTGTTCC 
GTTGGGGCCAGCACACCGGGATGGATGGAGGGAGAGCAGAGGCCTTTGCTTCTCTGCCTACG 
TCCCCTTAGATGGGCAGCAGAGGCAACTCCCGCATCCTTTGCTCTGCCTGTCGGTGGTCAGA 
GCGGTGAGCGAGGTGGGTTGGAGACTCAGCAGGCTCCGTGCAGCCCTTGGGAACAGTGAGAG 
GTTG AAGGT C AT AACG AG AG TGGG AACT C AAC C C AG AT C C CG C C C CTC CTGTCCTCTGTGTT 
CCCGCGGAAACCAACCAAACCGTGCGCTGTGACCCATTGCTGTTCTCTGTATCGTGATCTAT 
CCTCAACAACAACAGAAAAAAGGAATAAAATATCCTTTGTTTCCT 
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FIGURE 127 

MELVLVFLCSLl^PIWLASAAEKEKEMDPFHYDYQTLRIGGLVFAVVLFSVGILLILSRRCK 
CSFNQKPRAPGDEEAQVENLITANATEPQKQRTEVQPSGGSLWNLRRLLEPLDANVDA 
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FIGURE 128 

AAACTTGACGCCATGAAGATCCCGGTCCTTCCTGCCGTGGTGCTCCTCTCCCTCCTGGTGCT 
CCACTCTGCCCAGGGAGCCACCCTGGGTGGTCCTGAGGAAGAAAGCACCATTGAGAATTATG 
CGTCACGACCCGAGGCCTTTAACACCCCGTTCCTGAACATCGACAAATTGCGATCTGCGTTT 
AAGGCTGATGAGTTCCTGAACTGGCACGCCCTCTTTGAGTCTATCAAAAGGAAACTTCCTTT 
CCTCAACTGGGATGCCTTTCCTAAGCTGAAAGGACTGAGGAGCGCAACTCCTGATGCCCAGT 
GACCATGACCTCCACTGGAAGAGGGGGCTAGCGTGAGCGCTGATTCTCAACCTACCATAACT 
CTTTCCTGCCTCAGGAACTCCAATAAAACATTTTCCATCCAAA 
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FIGURE 129 

MKIPVLPAVVLLSLLVLHSAQGATLGGPEEESTIENYASRPEAFNTPFLNIDKLRSAFKADE 
FLNWHALFES I KRKLPFLNWDAFPKLKGLRSATPDAQ 
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FIGURE 110 

CAGTT CTG AAAT CAATGGAGTTAATTTAGGGAATACAAAC C AG C CATGGGGG TG GAGATTG C 
CTTTGCCTCAGTGATTCTCACCTGCCTCTCCCTTCTGGCAGCAGGAGTCTCCCAGGTTGTTC 
TTCTCCAGCCAGTTCCAACTCAGGAGACAGGTCCCAAGGCCATGGGAGATCTCTCCTGTGGC 
TTTGCCGGCCACTCAT^GAGTGTTTTTGTGTAAAGTATTTTTTAGAATACTGTTGACTTCT 
TCATGATTTAATAACCATCCTTTGCGAAGTTTTATGAGGCTTTAGGGGAATGTCAACCCTCA 
AATTTTTGTTATACTAGATGGCTTCCATTTACCCACCACTATTTTAAGGTCCCTTTATTTTT 
AGGTTCAAGGTTCATTTGACTTGAGAAAGTGCCCTTCTGCAGCTTCATTGATTTTGTTTATC 
TTCACTATTAATTGTAACGATTAAAAAAGAATAAGAGCACGCAGACCTCTAGGAGAATATTT 
TATCCCTGGGTGCCCCTGACACATTTATGTAGTGATCCCACAAATGTGATTGTTAATTTAAA 
TGTTATTCTAATATTAGTACATTCAGTTGTGATGTAATATGAATAACCAGAATCTATTTCTT 
AAAAGTTTTGAGTATATTTTTCAACTAGATATTTGTATAGAAAGACTGAATAGTGATG 
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FIGURE 131 

MGVEIAFASVILTCLSLLAAGVSQWLLQPVPTQETGPKAMGDLSCGFAGHS 
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FIGURE 132 

GGGGAATCTGCAGTAGGTCTGCCGGCGATGGAGTGGTGGGCTAGCTCGCCGCTTCGGCTCTG 
GCTGCTGTTGTTCCTCCTGCC.CTCAGCGCAGGGCCGCCAGAAGGAGTCAGGTTCAAAATGGA 
AAGTATTTATTGACCAAATTAACAGGTCTTTGGAGAATTACGAACCATGTTCAAGTCAAAAC 

TG CAG CTG CTAC CATGGTGT C ATAG AAG AGG AT C T AACTC CTTT CCGAGG AG G CAT CT C C AG 
GAAGATGATGGCAGAGGTAGTCAGACGGAAGCTAGGGACCCACTATCAGATCACTAAGAACA 
GACTGTACCGGGAAAATGACTGCATGTTCCCCTCAAGGTGTAGTGGTGTTGAGCACTTTATT 
TTGGAAGTGATCGGGCGTCTCCCTGACATGGAGATGGTGATCAATGTACGAGATTATCCTCA 
GGTTCCTAAATGGATGGAGCCTGCCATCCCAGTCTTCTCCTTCAGTAAGACATCAGAGTACC 
ATGATATCATGTATCCTGCTTGGACATTTTGGGAAGGGGGACCTGCTGTTTGGCCAATTTAT 
CCTACAGGTCTTGGACGGTGGGACCTCTTCAGAGAAGATCTGGTAAGGTCAGCAGCACAGTG 
GC CATGGAAAAAGAAAAACT CTACAG CATATTTCCGAGG ATCAAGGAC AAGT C CAGAACGAG 
ATCCTCTCATTCTTCTGTCTCGGAAAAACCCAAAACTTGTTGATGCAGAATACACCAAAAAC 
CAGGCCTGGAAATCTATGAAAGATACCTTAGGAAAGCCAGCTGCTAAGGATGTCCATCTTGT 
GGATCACTGCAAATACAAGTATCTGTTTAATTTTCGAGGCGTAGCTGCAAGTTTCCGGTTTA 
AACACCTCTTCCTGTGTGGCTCACTTGTTTTCCATGTTGGTGATGAGTGGCTAGAATTCTTC 
TATCCACAGCTGAAGCCATGGGTTCACTATATCCCAGTCAAAACAGATCTCTCCAATGTCCA 
AGAGCTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAGAGATTGCTGAAAGGGGAA 
GCCAGTTTATTAGGAACCATTTGCAGATGGATGACATCACCTGTTACTGGGAGAACCTCTTG 
AGTGAATACTCTAAATTCCTGTCTTATAATGTAACGAGAAGGAAAGGTTATGATCAAATTAT 
TCCCAAAATGTTGAAAACTGAACTATAGTAGTCATCATAGGACCATAGTCCTCTTTGTGGCA 
ACAGATCTCAGATATCCTACGGTGAGAAGCTTACCATAAGCTTGGCTCCTATACCTTGAATA 
TCTGCTATCAAGCCAAATACCTGGTTTTCCTTATCATGCTGCACCCAGAGCAACTCTTGAGA 
AAGATTTAAAATGTGTCTAATACACTGATATGAAGCAGTTCAACTTTTTGGATGAATAAGGA 
CCAGAAATCGTGAGATGTGGATTTTGAACCCAACTCTACCTTTCATTTTCTTAAGACCAATC 
ACAGCTTGTGCCTCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAATTGACTGTGTCCA 
TGTGATGATGCCCTTTGTCCCATTATTTGGAGCAGAAAATTCGTCATTTGGAAGTAGTACAA 
CTCATTGCTGGAATTGTGAAATTATTCAAGGCGTGATCTCTGTCACTTTATTTTAATGTAGG 
AAACCCTATGGGGTTTATGAAAAATACTTGGGGATCATTCTCTGAATGGTCTAAGGAAGCGG 
TAGCCATGCCATGCAATGATGTAGGAGTTCTCTTTTGTAAAACCATAAACTCTGTTACTCAG 
GAGGTTTCTATAATGCCACATAGAAAGAGGCCAATTGCATGAGTAATTATTGCAATTGGATT 
TCAGGTTCCCTTTTTGTGCCTTCATGCCCTACTTCTTAATGCCTCTCTAAAGCCAAA 
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FIGURE 133 

MEWWASSPLRLWLLLFLLPSAQGRQKESGSKWKVFIDQINRSLEKYEPCSSQNCSCYHGVIE 
EDLTPFRGGISRKMMAEWRRKLGTHYQITKNRLYRENDCMFPSRCSGVEHFILEVIGRLPD 
MEMVINVRDYPQVPKWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDL 
FREDLTOSAAQWPWKKiCNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDT 
LGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVH 
YIPVKTDLSNVQELLQFVKANDDVAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 
NVTRRKGYDQ 1 1 PKMLKTEL 
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FIGURE 134 

CACCCCTCCATTTCTCGCCATGGCCCCTGCACTGCTCCTGATCCCTGCTGCCCTCGCCTCTT 

TCATCCTGGCCTTTGGCACCGGAGTGGAGTTCGTGCGCTTTACCTCCCTTCGGCCACTTCTT 

GGAGGGATCCCGGAGTCTGGTGGTCCGGATGCCCGCCAGGGATGGCTGGCTGCCCTGCAGGA 

CCGCAGCATCCTTGCCCCCCTGGCATGGGATCTGGGGCTCCTGCTTCTATTTGTTGGGCAGC 

ACAGCCTCATGGCAGCTGAAAGAGTGAAGGCATGGACATCCCGGTACTTTGGGGTCCTTCAG 

AGGTCACTGTATGTGGCCTGCACTGCCCTGGCCTTGCAGCTGGTGATGCGGTACTGGGAGCC 

CATACCCAAAGGCCCTGTGTTGTGGGAGGCTCGGGCTGAGCCATGGGCCACCTGGGTGCCGC 

TCCTCTGCTTTGTGCTCCATGTCATCTCCTGGCTCCTCATCTTTAGCATCCTTCTCGTCTTT 

GACTATGCTGAGCTCATGGGCCTCAAACAGGTATACTACCATGTGCTGGGGCTGGGCGAGCC 

TCTGGCCCTGAAGTCTCCCCGGGCTCTCAGACTCTTCTCCCACCTGCGCCACCCAGTGTGTG 

TGGAGCTGCTGACAGTGCTGTGGGTGGTGCCTACCCTGGGCACGGACCGTCTCCTCCTTGCT 

TTCCTCCTTACCCTCTACCTGGGCCTGGCTCACGGGCTTGATCAGCAAGACCTCCGCTACCT 

CCGGGCCCAGCTACAAAGAAAACTCCACCTGCTCTCTCGGCCCCAGGATGGGGAGGCAGAGT 

GAGGAGCTCACTCTGGTTACAAGCCCTGTTCTTCCTCTCCCACTGAATTCTAAATCCTTAAC 

ATCCAGGCCCTGGCTGCTTCATGCCAGAGGCCCAAATCCATGGACTGAAGGAGATGCCCCTT 

CTACTACTTGAGACTTTATTCTCTGGGTCCAGCTCCATACCCTAAATTCTGAGTTTCAGCCA 

CTGAACTCCAAGGTCCACTTCTCACCAGCAAGGAAGAGTGGGGTATGGAAGTCATCTGTCCC 

TTCACTGTTTAGAGCATGACACTCTCCCCCTCAACAGCCTCCTGAGAAGGAAAGGATCTGCC 

CTGACCACTCCCCTGGCACTGTTACTTGCCTCTGCGCCTCAGGGGTCCCCTTCTGCACCGCT 

GGCTTCCACTCCAAGAAGGTGGACCAGGGTCTGCAAGTTCAACGGTCATAGCTGTCCCTCCA 

GGCCCCAACCTTGCCTCACCACTCCCGGCCCTAGTCTCTGCACCTCCTTAGGCCCTGCCTCT 

GGGCTCAGACCCCAACCTAGTCAAGGGGATTCTCCTGCTCTTAACTCGATGACTTGGGGCTC 

CCTGCTCTCCCGAGGAAGATGCTCTGCAGGAAAATAAAAGTCAGCCTTTTTCTAAAAAAAA 
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FIGURE 135 

MAPALLLIPAALASFIIAFGTGTOFTOFTSLRPLLGGIPESGGPDARQGWLAALQDRSILAP 
IAWDLGLLLLFVGQHSLMAAERVKAWTSRYFGVLQRSLYVACTALALQLVMRYWEPIPKGPV 
LWEARAEPWATWVPLLCFVLHV I S WLLI FS I LLVFD YAELMGLKQVYYHVLGLGE PLALKS P 
RALRLFSHLRHPVCVELLTVLWWPTLGTDRLLLAFLLTLYLGLAHGLDQQDLRYLRAQLQR 
KLHLLS RPQDGEAE 
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FTGTJRE 136 

CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGA 
AGAAATTGCCAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGT 
TTTAATTAAGCATGGAATACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATT 
CCACAGTTTTCTTAGCTCCCTGGACCCGGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTA 
TCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTAAAGAACCTTCGGCTCGCGTGCTTCTG 
AGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTAGGATGTCACTGAGATCC 
CTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTACCTCAGCCT 
TCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTT 
ACAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATGAAAATCCATTT 
CTGGTCATTCTGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTAC 
TTGGGGTGAAAAAAAGTCTTGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAG 
AGGCTGAAAAGGAAGACAAAATGTTGGCATTGTCCTTAGAGGATGAACACCTTCTTTATGGT 
GACATAATCCGACAAGATTTTTTAGACACATATAATAACCTGACCTTGAAAACCATTATGGC 
ATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATGAAGACAGACACTGATG 
TTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGAGAAGTTT 

TTCACAGGTT AT C C TCTAATTG ATAATTATTCCTATAG AGGATTTTAC CAAAAAAC CCATAT 
TTCTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAA 
TGTCCAGAGATTTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTT 
GAAGATGTTTATGTCGGGATCTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGA 
CACAAATCTTTTCTTTCTATATAGAATCCATTTGGATGTCTGTCAACTGAGACGTGTGATTG 
CAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTTTGGCAGGTCATGCTAAGGAACACC 
ACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAGGATACCTTGTGGAAA 
GTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTTACACTG 
AACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTC 
AGGCCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAA 
GAAATTAATAGGACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGG 
TGTTACTGAGTTATAAGCTCACTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTG 
AACAATGTAGTCACTTGAAGGTTTTGTGTATATCTTATGTGGATTACCAATTTAAAAATATA 
TGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATACTGAACAAAATTTTACCTGTTTT 
TGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATTATTATTTAAAATTA 
CTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAGTGAAT 
CATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCAC 
TCCATTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAAT 
ATTTTACTGTGGTAATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 137 

MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYF 
FHFTLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEK 
EDKMIJU^SLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFIN 
TGNLVKYLLKLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRD 
LVPR I YEMMGHVKP I KFEDVYVG I CLNLLKVN I H I PEDTNL F FLYR I HLD VCQLRRV I AAHG 
FSSKEII TFWQVMLRNTTCHY 
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FIGIJRE 138 



CCTCTGTCCACTGCTTTCGTGAAGACAAGATGAAGTTCACAATTGTCTTTGCTGGACTTCTT 
GG AGT CTTT CTAGCTC CTG C C CTAG CTAACTATAATAT C AACGT CAATGATG AC AA CAACAA 
TGCTGGAAGTGGGCAGCAGTCAGTGAGTGTCAACAATGAACACAATGTGGCCAATGTTGACA 
ATAACAACGGATGGGACTCCTGGAATTCCATCTGGGATTATGGAAATGGCTTTGCTGCAACC 
AGACTCTTTCAAAAGAAGACATGCATTGTGCACAAAATGAACAAGGAAGTCATGCCCTCCAT 
TCAATCCCTTGATGCACTGGTCAAGGAAAAGAAGCTTCAGGGTAAGGGACCAGGAGGACCAC 
CTCCCAAGGGCCTGATGTACTCAGTCAACCCAAACAAAGTCGATGACCTGAGCAAGTTCGGA 
AAAAACATTGC7VAACATGTGTCGTGGGATTCCAACATACATGGCTGAGGAGATGCAAGAGGC 
AAGCCTGTTTTTTTACTCAGGAACGTGCTACACGACCAGTGTACTATGGATTGTGGACATTT 
CCTTCTGTGGAGACACGGTGGAGAACTAAACAATTTTTTAAAGCCACTATGGATTTAGTCAT 
CTGAATATGCTGTGCAGAAAAAATATGGGCTCCAGTGGTTTTTACCATGTCATTCTGAAATT 
TTT CT CT ACTAGTTATGTTT GATTT CTTT AAGTTT C AATAAAAT CATTTAG C ATTGAAAAAAA 
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FIGURE 139 

MKFTIVFAGLLGVFLAPALANYNIITVN^ 

IWDYGNGFAATRLFQKKTCIVHKMNKEVMPSIQSLDALVKEKKLQGKGPGGPPPKGLMYSVN 
PNr^DLSKFGKNIANMCRGIPTYMAEEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 
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FIGURE 140 

CATTTCTGAAACTAATCGTGTCAGAATTGACTTTGAAAAGCATTGCTTTTTACAGAAGTATA 
TTAACTTTTTAGGAGTAATTTCTAGTTTGGATTGTAATATGAAATAATTTAAAAGGGCTTCG 
CTCATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTCTTATTGCTTACTGATTTTTT 
TGAGTTAAGAGTTGTTATATGCTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAA 
GAATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATG 
CAAGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGT 
TTGTTCGATTTCAACCAGAGACTATAGCATGTGCTTGCATCTACCTTGCAGCTAGAGCACTT 
CAGATTCCGTTGCCAACTCGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAAT 
CCAGGAAATCTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTAC 
TG GAAAAAGAAG T A GAAAAAAG AAAAGT AG C CTT ACAAGAAG CCAAAT TAAAAG CAAAGGG A 
TTGAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCCAGCCTCCAAGCC 
AT CATC AC CAAGAGAAG TAAAAG CTGAAGAGAAATC AC CAATCT CCATTAATGTGAAG ACAG 
TCAAAAAAGAACCTGAGGATAGACAACAGGCTTCCAAAAGCCCTTACAATGGTGTAAGAAAA 
G A CAGCAAGAGAAGTAGAAATAG C AGAAG T G C AAGT CG AT CG AG GT CAAGAACA C GAT CACG 
TT CTAGAT CACATACTC C AAG AAGACACTATAATAATAGGCGGAGT CGAT CTGGAACATACA 
G CTC GAGAT CAAGAAGC AGGTC C CG C AGT C ACAGTG AAAGC CCT CG AAGACATCATAAT CAT 
GGTTCTCCTCACCTTAAGGCCAAGCATACCAGAGATGATTTAAAAAGTTCAAACAGACATGG 
TCATAAAAGGAAAAAATCTCGTTCTCGATCTCAGAGCAAGTCTCGGGATCACTCAGATGCAG 
CCAAGAAACACAGGCATGAAAGGGGACATCATAGGGACAGGCGTGAACGATCTCGCTCCTTT 
GAGAGGTCCCATAAAAGCAAGCACCATGGTGGCAGTCGCTCAGGACATGGCAGGCACAGGCG 
CTGA CTTTCTCTTCCTTTGAGCCTGCATCAGTTCTTGGTTTTGCCTATCTACAGTGTGATGT 
ATGGACTCAATCAAAAACATTAAACGCAAACTGATTAGGATTTGATTTCTTGAAACCCTCTA 
GGTCTCTAGAACACTGAGGACAGTTTCTTTTGAAAAGAACTATGTTAATTTTTTTGCACATT 
AAAATGCCCTAGCAGTATCTAATTAAAAACCATGGTCAGGTTCAATTGTACTTTATTATAGT 
TGTGTATTGTTTATTGCTATAAGAACTGGAGCGTGAATTCTGTAAAAATGTATCTTATTTTT 
ATACAGATAAAATTGCAGACACTGTTCTATTTAAGTGGTTATTTGTTTAAATGATGGTGAAT 
ACTTTCTTAACACTGGTTTGTCTGCATGTGTAAAGATTTTTACAAGGAAATAAAATACAAAT 
CTTGTTTTTTCTAAAAAAAAAAAAAAAAAAGT 
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FIGURE 141 

MNDSLRTNVFVRFQPETIACACIY^ 

TRKKPNYELLEKEVEKRKVALQEAKXKAKGLNPDGTPALSTLGGFSPASKPSSPREVKAEEK 
SPISINVKTVKKEPEDRQQASKSPYNGVRKDSKRSRNSRSASRSRSRTRSRSRSHTPRRHYN 
NRRSRSGTYSSRSRSRSRSHSESPRRHHNHGSPHLKAKHTRDDLKSSNRHGHKRKKSRSRSQ 
SKSRDHSDAAKKHRHERGHHRDRRERSRSFERSHKSKHHGGSRSGHGRHRR 
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FIGURE 142 

TGGGGATAAAGGAAAAATGGTCAGGTATTAATGGCTTAAAGATTATTGGAAGGGGTTTATCA 
TTTTTTGAANNTATTCGGGTCANAATTGNCTTTGAAAAGCATTGCTTTTTACAGAAATATAT 
TANCTTTTTAGAGTAATTTCTAGTTTGGATTGTAATATGAAATTATTTAAAAGGGCTTCGCT 
CATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTNTTATTGCTTACTGATTTTTTTG 
AGTTAAGAGTTGTTATATGNTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAAGA 
ATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATGCA 
AG CTTATAGTTGAAAT ATTTTT CAGGAATTACATGAATGACAGTCTTCGAACCAATGTGTTT 
GTTCGATTTCAACCAGAGANTATAGCATGTGCTTGCATCTACCTTGCAGNTAGAGCACTTCA 
GATTCCGTTGCCAACTNGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAATCC 
AGGAAATNTG CATAGAAACACTTAGGCTTTATACCAGAAAAAAG CCAAACTATGAATTACTG 
GAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAG CCNAATTAAAAG CAAAGGGATT 
GAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCC 
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FIGURE 143 

GGCACGAGGCCTCGTGCCAAGCTTGGCACGAGGGTGCACCGCGTTCTCGCACGCGTCATGGC 
GGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCACCCTCATGCACAGGCTGGCGC 
CACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTTTGTTCCGATACAAGCACCCG 
TCTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCCCAGAGGCAGGAAAGAGCGGTG 
GGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCCGAGATGCCCCGTTCCAGCTGG 
AGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGCTTCTTCCTGGAGTACCAGTGG 
TTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTTCACAGAGGCCTACTACTACAT 
GCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGTGCCTGCTCACGGTGACCTTCT 
CCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGCGCCGAGGAGGGGGGTGAGCGC 
TCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGCCATGCTGGTGCAAGTGGTGCG 
GGAGGAGACCCTCGAGCTGGGCCTGGAGCCTGGTCTGGCCAGCATGACCCAGAACTTAGAGC 
CACTTCTGAAGAAGCAGGGCTGGGACTGGGCGCTTCCTGTGGCCAAGCTGGCTATCCGCGTG 
GGACTGGCAGTGGTGGGCTCTGTGCTGGGTGCCTTCCTCACCTTCCCAGGCCTGCGGCTGGC 
CCAGACCCACCGGGACGCACTGACCATGTCGGAGGACAGACCCATGCTGCAGTTCCTCCTGC 
ACACCAGCTTCCTGTCTCCCCTGTTCATCCTGTGGCTCTGGACAAAGCCCATTGCACGGGAC 
TTCCTGCACCAGCCGCCGTTTGGGGAGACGCGTTTCTCCCTGCTGTCCGATTCTGCCTTCGA 
CTCTGGGCGCCTCTGGTTGCTGGTGGTGCTGTGCCTGCTGCGGCTGGCGGTGACCCGGCCCC 
ACCTGCAGGCCTACCTGTGCCTGGCCAAGGCCCGGGTGGAGCAGCTGCGAAGGGAGGCTGGC 
CGCATCGAAGCCCGTGAAATCCAGCAGAGGGTGGTCCGAGTCTACTGCTATGTGACCGTGGT 
GAGCTTGCAGTACCTGACGCCGCTCATCCTCACCCTCAACTGCACACTTCTGCTCAAGACGC 
TGGGAGGCTATTCCTGGGGCCTGGGCCCAGCTCCTCTACTATCCCCCGACCCATCCTCAGCC 
AGCGCTGCCCCCATCGGCTCTGGGGAGGACGAAGTCCAGCAGACTGCAGCGCGGATTGCCGG 
GGCCCTGGGTGGCCTGCTTACTCCCCTCTTCCTCCGTGGCGTCCTGGCCTACCTCATCTGGT 
GGACGGCTGCCTGCCAGCTGCTCGCCAGCCTTTTCGGCCTCTACTTCCACCAGCACTTGGCA 
GGCTCCTAGCTGCCTGCAGACCCTCCTGGGGCCCTGAGGTCTGTTCCTGGGGCAGCGGGACA 
CTAGCCTGCCCCCTCTGTTTGCGCCCCCGTGTCCCCAGCTGCAAGGTGGGGCCGGACTCCCC 
GGCGTTCCCTTCACCACAGTGCCTGACCCGCGGCCCCCCTTGGACGCCGAGTTTCTGCCTCA 
GAACTGTCTCTCCTGGGCCCAGCAGCATGAGGGTCCCGAGGCCATTGTCTCCGAAGCGTATG 
TGCCAGGTTTGAGTGGCGAGGGTGATGCTGGCTGCTCTTCTGAACAAATAAAGGAGCATGCC 
GATTTTTAA 
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FIGURE 144 



r^VLGVQLVVTLLTATLMKRLAPHCSFARWLLCNGSLFRYKHPSEEELRALAGKPRPRGRKE 
RWANGLSEEKPLSVPRDAPFQLETCPLTTVDALVLRFFLEYQWFVDFAVySGGVYLFTEAYY 
YMLGPAKETNI AVFWCLLTVTFS I KMFLTVTRLYFSAEEGGERSVCLTFAFLFLLLAMLVQV 
VREETLELGLEPGLASMTQNLEPLLKKQGWDWALPVAKLAIRVGLAWGSVLGAFLTFPGLR 
LAQTHRDALTMSEDRPMLQFLLHTSFLSPLFILWLWTKPIARDFLHQPPFGETRFSLLSDSA 
FDSGRLWLLVVLCLLRLAVTRPHLQAYLCLAKARVEQLRREAGRIEAREIQQRVVRVYCYVT 
WSLQYLTPL ILTLNCTLLLKTLGGYSWGLGPAPLLS PDPSSASAAP I GSGEDE VQQTAARI 
AGALGGLLTPLFLRGVLAYLIWWTAACQLLASLFGLYFHQHLAGS 
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FIGURE 145 

CGTTNGCACGCGTCAATGGCGGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCAC 
CCTCATGCACAGGCTGGCGCCACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTT 
TGTTCCGATACAAGCACCCGTNTTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCC 
CAGAGGCAGGAAAGAGCGGTGGGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCC 
GAGATGCCCCGTTCCAGCTGGAGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGC 
TTCTTCCTGGAGTACCAGTGGTTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTT 
CACAGAGGCCTACTACTACATGCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGT 
GCCTGCTCACAGTGACCTTCTCCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGC 
GCCGAGGAGGGGGGTGAGCGCTCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGC 
CATGCTGGTGCAAGCG 
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FIGURE 146 

GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTA 
TTAACGTGGCTTAATCTGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGC 
ATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGGTTTGGGCCCTTGTAGCTGACAGAAGGTGGC 
CAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGCTGGTCTTGCCTTGG 
CTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTG 
TAAAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTC 
CAGACGGCTGTGCGAGCCTCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACC 
ATCTCCTTAATGACAGACGAGCCTGGCCTAGACAACCCTGCCTACGTGTCCTCGGCAGAGGA 
CGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTAGGGCACGGCCCT 
TTGAGAG AT C CACTATTAGAAG CAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTT 
CGAAGGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGA 
AAACACCACTGCCCCTGAAGTCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTA 
CCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAAGCCTCTCTATTAGGCTGGTGGGAGGT 
AGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGGTGATCGCCAG 
AGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATG 
TCCCTCACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTG 
ATGCGTGAACAGAAGTTCCGCAGCAGGAACAATGGACAGGCCCCGGATGCCTACAGACCCCG 
AGATGACAGCTTTCATGTGATTCTCAACAAAAGTAGCCCCGAGGAGCAGCTTGGAATAAAAC 
TGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCGGTGTGGCA 
TATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCG 
ATATGGCAGCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCG 
TCGTGTCCCGCCAGGTTCGGCAGCGGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGC 
AATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACACTCCCAAGCCCCTCCATCCTAC 
AATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCGGCATGA 
CCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAG 
CCCGGAGGAGTCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGA 
TGGGGTCGAACTGACAGAGGTCAGCCGGAGTGAGGCAGTGGCATTATTGAAAAGAACATCAT 
CCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGCCCCAGGAAGACTGCAGCAGC 
CCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCTGGGT 
CATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACA 
CAGCTGGAAGTCTGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCT 
TTTTTCATCAAATCCATTGTTGAAGGAACACCAGCATACAATGATGGAAGAATTAGATGTGG 
TGATATTCTTCTTGCTGTCAATGGTAGAAGTACATCAGGAATGATACATGCTTGCTTGGCAA 
GACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTTTT 
TTATAGAATCAATGATGGGTCAGAGGAAAACAGAAAAATCACAAATAGGCTAAGAAGTTGAA 
ACACTATATTTATCTTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGG 
AAAAGTATGATCAT CTAATGAAAGC C AGTT ACAC CT CAGAAAAT ATGATT CC AAAAAAATTA 
AAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTACAACATTGTTTATATTT 
TTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAAGTCTGCCAAGGGTACATTATGGCCA 
TTTTTAATTTACAGCTAAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAA 
ACAAGAATAAATATTTTTCAGAAGTTAAA 
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FIGURE 147 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTAT 
APSPEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRS 
FKKINRALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDP 
SESLSIRLVGGSETPLVHI I IQHIYRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLL 
RQP CQVLWLTVMREQKFR S RNNGQAPDA YR PRDD S FHVI LNKS S PEEQLG I KLVRKVDE PG V 
F I FNVL DGG VAYRHGQLE ENDR VLAI NGHDLRYG S P ES AAHL I QASERR VHL WSRQVRQRS 
PDIFQEAGWNSNGSWSPGPGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHRE 
WDLPIYVISVEPGGVISRDGRIKTGDILLNVDGVELTEVSRSEAV/OjLKRTSSSIVLKALEV 
KEYEPQEDCSSPAALDSNHNMAPPSDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIV 
GGYEEYNGNKPFFIKSIVEGTPAYNDGRIRCGDILLAVNGRSTSGMIHACLARLLKELKGRI 
TLTIVSWPGTFL 
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FIGURE 148 

CCA?^AGTGATCATTTGAAAAAGAGATATCCACATCTTCAAGCCCATATAAAGGATAGAAGCT 
GCACAGGGCAGCTTTACTTACTCCAGCACCTTCCTCTCCCAGGCAAATGGTGCTGACCATCT 
TTGGGATACAATCTCATGGATACGAGGTTTTTAACATCATCAGCCCAAGCAACAATGGTGGC 
AATGTTCAGGAGACAGTGACAATTGATAATGAAAAAAATACCGCCATCGTTAACATCCATGC 
AGGATCATGCTCTTCTACCACAATTTTTGACTATAAACATGGCTACATTGCATCCAGGGTGC 
TCTCCCGAAGAGCCTGCTTTATCCTGAAGATGGACCATCAGAACATCCCTCCTCTGAACAAT 
CTCCAATGGTACATCTATGAGAAACAGGCTCTGGACAACATGTTCTCCMCAAATACACCTG 
GGTCAAGTACAACCCTCTGGAGTCTCTGATCAAAGACGTGGATTGGTTCCTGCTTGGGTCAC 
CCATTGAGAAACTCTGCAAACATATCCCTTTGTATAAGGGGGAAGTGGTTGAAAACACACAT 
AATGTCGGTGCTGGAGGCTGTGCAAAGGCTGGGCTCCTGGGCATCTTGGGAATTTCAATCTG 
TGCAGACATTCATGTTTAGGATGATTAGCCCTCTTGTTTTATCTTTTCAAAGAAATACATCC 
TTGGTTTACACTCAAAAGTCAAATTAAATTCTTTCCCAATGCCCCAACTAATTTTGAGATTC 
AGTCAGAAAATATAAATG CTGTATTTATA 
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FIGURE 149 

MK I L. VAFLWLT I FG I QS HGYEVFN USPS NNGGNVQETVT I DNEKNTAI VN IHAGSC S STT 
I FDYKHGY I ASRVLSRRACF I L.KMDHQN I P PLNNLQWY I YEKQ ALDNMFSNKYT WVKYNP LE 
SLIKDVDWFLLGSPIEKLCKHIPLYKGEVVENTHNVGAGGCAKAGLLGILGISICADIHV 
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FIGURE 1 50 



GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAGGC 

ATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGGCACG 

GGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCCCCAAAG 

GCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCTATTCCCTC 

TGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGCCGGCCTCCTT 

CAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCTACTTCTGCCGGGCGTCCT 

CCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGGGAGCTGTGGTCCAAG 

CCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGGGCAGGCCCCAGGGTGGA 

GATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAACAGCCTGATCGGGAAGGATG 

GG CAGGTCCACCTGCAGCAGAGACCATGCCACAGGCAGCCTGCCAACTTCTCCTTCCTGCCG 

AGCCAGACATCGGACTGGTTCTGGTGCCAGGCTGCAAACAACGCCAATGTCCAGCACAGCGC 

CCTCACAGTGGTGCCCCCAGGTGGTGACCAGAAGATGGAGGACTGGCAGGGTCCCCTGGAGA 

GCCCCATCCTTGCCTTGCCGCTCTACAGGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGG 

GGGTTCAGGATAGGGAATGGGGAGGTCAGAGGACGCAAAGCAGCAGCCATGTAGAATGAACC 

GTCCAGAGAGCCAAGCACGGCAGAGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGA 

GTTCATGCAAAATGAGTGTGTTTTAGCTGCTCTTGCCACAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 151 

MGLPGLFCLAVLAASSFSKAREEEITPWSIAYKVLEVFPKGRVA/LITCCAPQPPPPITYSL 
CGTKNIKVAKKVVKTHEPASFNLNVTLKSSPDLLTYFCRASSTSGAHVDSARLQMHWELWSK 
PVSELRANFTLQDRGAGPRVEMICQASSGSPPITNSLIGKDGQVHLQQRPCHRQPANFSFLP 
SQTSDWFWCQAANNANVQHSALTWP PGGDQKMEDWQGPLES P I LALPLYRSTRRLSEEEFG 
GFRIGNGEVRGRKAAAM 
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FIGURE 152 

GGTCCTTAATGGCAGCAGCCGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCCTG 

CTGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCACCGT 

CATCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATGAAAAGA 

CTTTTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTGGGGAAGAAA 

CTAAATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTGAGAGAGGTGGTGGACATACT 

TACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCTGC 

AGGCAAGGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTTCAGT 

TTCGATGGGCAGATCTTCC-TCCTCTTTGACTCAGAGAAGAGAATGTGGACAACGGTTCATCC 

TGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTCCTTCCATT 

ACTTCTCAATGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGATGGGCATGGACAGCACC 

CTGGAGCCAAGTGCAGGAGCACCACTCGCCATGTCCTCAGGCACAACCCAACTCAGGGCCAC 

AGCCACCACCCTCATCCTTTGCTGCCTCCTCATCATCCTCCCCTGCTTCATCCTCCCTGGCA 

TCTGAGGAGAGTCCTTTAGAGTGACAGGTTAAAGCTGATACCAAAAGGCTCCTGTGAGCACG 

GTCTTGATCAAACTCGCCCTTCTGTCTGGCCAGCTGCCCACGACCTACGGTGTATGTCCAGT 

GGCCTCCAGCAGATCATGATGACATCATGGACCCAATAGCTCATTCACTGCCTTGATTCCTT 

TTGCCAACAATTTTACCAGCAGTTATACCTAACATATTATGCAATTTTCTCTTGGTGCTACC 

TGATGGAATTCCTGCACTTAAAGTTCTGGCTGACTAAACAAGATATATCATTTTCTTTCTTC 

TCTTTTTGTTTGGAAAATCAAGTACTTCTTTGAATGATGATCTCTTTCTTGCAAATGATATT 

GTCAGTAAAATAATCACGTTAGACTTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAG 

AATTTTTAAATTATTTAATAAGAAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTAT 

TGTTCTGTACTGATATTTAAATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 
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FIGURE 153 

MAAAAATKILLCLPLLLLLSGWSRAGRADPHSLCYDITVIPKFRPGPRWCAVQGQVDEKTFL 
HYDCGNKTVTPVSPLGKKI^TTTAWKAQNPVLREVVDILTEQLRDIQLENYTPKEPLTLQAR 
MSCEQKAEGHSSGSWQFSFDGQIFLLFDSEKi^MWTTVHPGARKMKEKWENDKVVAMSFHYFS 
MGDCIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLIILPCFILPGI 
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FIGURE 154 

GGGAAAGCCATTTCGAAAACCCATCTATACAAACTATATATTTTCATTTCTGCTGCTAGCTG 
CCTTGGGCCTCACAATTTTCATTCTGTTTTCrrGACTTTCAAGTTATATACCGTGGAATGGAG 
TTGATCCCAACCATAACATCGTGGAGGGTTTT^TTTTGGTGGTAGCCCTCACCCAATTCTG 
GTGTGGCTTTCTTTGCAGAGGATTCCACCTTCAAAATCATGAACTCTGGCTGTTGATCAAAA 
GAGAATTTGGATTCTACTCTAAAAGTCAATATAGGACTTGGCAAAAGAAGCTAGCAGAAGAC 
TCAACCTGGCCTCCCATAAACAGGACAGATTATTCAGGTGATGGCAAAAATGGATTCTACAT 
CAACGGAGGCTATGAAAGCCATGAACAGATTCCAAAAAGAAAACTCAAATTGGGAGGCCAAC 
CCACAGAACAGCATTTCTGGGCCAGGCTGTAATCAGAATTGTCGTCGTACATGCTCAACAGC 
ATTGCTTTTTTCCCCAAAATTAACACATTGTGGAGAAGTGATGATACTCTCCCCTTACCTTT 
CCTCTCTC CATT CAAG CATTCAAAGT AT ATTTTCAATGAATTAAACCTTG CAGCAAGGG AC C 
TTAGATAGGCTTATTCTGACTGTATGCTTTACCAATGAGAGAAAAAAATGCATTTCCTGTAT 
CATCCTTTTCAATAAACTGTATTCATTTTGAAAAAAAAAAAAAAAAAAAAAAA 



WO 99/63088 , ^ PCTAJS99/122S2 



FIGURE 155 



MELIPTITSWRVLILWALTQFWCGFLCRGFHLQ^ELVTLLIKREFGFYSKSQYRTWQKKLA 
EDSTWPPINRTDYSGDGKNGFYINGGYESHEQIPKRKLKLGGQPTEQHFWARL 
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PCT/US99/12252 



GTTCTCCTTTCCGAGCCAAAATCCCAGGCGATGGTGAATTATGAACGTGCCACACC^TGAAG 
CTCTTGTGGCAGGTAACTGTGCACCACCACACCTGGAATGCCATCCTGCTCCCGTTCGTCTA 
CCTCACGGCGCAAGTGTGGATTCTGTGTGCAGCCATCGCTGCTGCCGCCTCAGCCGGGCCCC 
AGAACTGCCCCTCCGTTTGCTCGTGCAGTAACCAGTTCAGCAAGGTGGTGTGCACGCGCCGG 
GGCCTCTCCGAGGTCCCGCAGGGTATTCCCTCGAACACCCGGTACCTCAACCTCATGGAGAA 
CAACATCCAGATGATCCAGGCCGACACCTTCCGCCACCTCCACCACCTGGAGGTCCTGCAGT 
TGGGCAGGAACTCCATCCGGCAGATTGAGGTGGGGGCCTTCAACGGCCTGGCCAGCCTCAAC 
ACCCTGGAGCTGTTCGACAACTGGCTGACAGTCATCCCTAGCGGGGCCTTTGAATACCTGTC 
CAAGCTGCGGGAGCTCTGGCTTCGCAACAACCCCATCGAAAGCATCCCCTCTTACGCCTTCA 
ACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCTCAAGAAGCTGGAGTATATCTCT 
GAGGGAGCTTTTGAGGGGCTGTTC^CCTCAAGTATCTGAACTTGGGCATGTGCAACATTAA 
AGACATGCCCAATCTCACCCCCCTGGTGGGGCTGGAGGAGCTGGAGATGTCAGGGAACCACT 
TCCCTGAGATCAGGCCTGGCTCCTTCCATGGCCTGAGCTCCCTCAAGAAGCTCTGGGTCATG 
AACTCACAGGTCAGCCTGATTGAGCGGAATGCTTTTGACGGGCTGGCTTCACTTGTGGAACT 
CAACTTGGCCCACAATAACCTCTCTTCTTTGCCCCATGACCTCTTTACCCCGCTGAGGTACC 
TGGTGGAGTTGCATCTACACCACAACCCTTGGAACTGTGATTGTGACATTCTGTGGCTAGCC 
TGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGCCGCTGTCATGCTCCCAT 
GCACATGCGAGGCCGCTACCTCGTGGAGGTGGACCAGGCCTCCTTCCAGTGCTCTGCCCCCT 
TCATCATGGACGCACCTCGAGACCTCAACATTTCTGAGGGTCGGATGGCAGAACTTAAGTGT 
CGGACTCCCCCTATGTCCTCCGTGAAGTGGTTGCTGCCCAATGGGACAGTGCTCAGCCACGC 
CTCCCGCCACCCAAGGATCTCTGTCCTCAACGACGGCACCTTGAACTTTTCCCACGTGCTGC 
TTTCAGACACTGGGGTGTACACATGCATGGTGACCAATGTTGCAGGCAACTCCAACGCCTCG 
GCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACTACAGCTTCTTCACCACAGT 
AACAGTGGAGACCACGGAGATCTCGCCTGAGGACACAACGCGAAAGTACAAGCCTGTTCCTA 
CCACGTCCACTGGTTACCAGCCGGCATATACCACCTCTACCACGGTGCTCATTCAGACTACC 
CGTGTGCCCAAGCAGGTGGCAGTACCCGCGACAGACACCACTGACAAGATGCAGACCAGCCT 
GGATGAAGTCATGAAGACCACCAAGATCATCATTGGCTGCTTTGTGGCAGTGACTCTGCTAG 
CTGCCGCCATGTTGATTGTCTTCTATAAACTTCGTAAGCGGCACCAGCAGCGGAGTACAGTC 
ACAGCCGCCCGGACTGTTGAGATAATCCAGGTGGACGAAGACATCCCAGCAGCAACATCCGC 
AGCAGCAACAGCAGCTCCGTCCGGTGTATCAGGTGAGGGGGCAGTAGTGCTGCCCACAATTC 
ATGACCATATTAACTACAACACCTACAAACCAGCACATGGGGCCCACTGGACAGAAAACAGC 
CTGGGGAACTCTCTGCACCCCACAGTCACCACTATCTCTGAACCTTATATAATTCAGACCCA 
TACCAAGGACAAGGTACAGGAAACTCAAATATGACTCCCCTCCCCCAAAAAACTTATAAAAT 
GCAATAGAATGCACACAAAGACAGCAACTTTTGTACAGAGTGGGGAGAGACTTTTTCTTGTA 
TATGCTTATATATTAAGTCTATGGGCTGGTTAAAAAAAACAGATTATATTAAAATTTAAAGA 
CAAAAAGTCAAAACA 
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FIGURE 157 

MKLLWQVTVHHHTWNAIIJjPFVYLTAQW 

RRGLSEVPQG I PSNTR YLNLMENN I QMI QADTFRHLHKLE VLQLGRNS I RQI EVGAFNGLAS 
LNTLELFDNWLTVI PSGAFE YLSKLRELWLRNNP I ES I PS YAFNRVPSLMRLDLGELKKLEY 
ISEGAFEGLFNLPCYIjNLGMCNIKDMPNLTPLVGLEELEMSGNHFPEIRPGSFHGLSSLKKLW 
VMN S QVS L I ERN AF DG LAS LVE LNLAHNNLS SLPHDLFTP LR YLVELHLHHN PWNCDCD I LW 
IAWVJLREY I PTNSTCCGRCHAPMHMRGRYLVEVDQASFQCSAPFIMDAPRDLNI SEGRMAEL 
KCRTPPMSSVKWLLPNGTVLSHASRHPRISVLNDGTI^ 

ASAYLNVSTAELNTSNYSFFTTVTVETTEISPEDTTRKYKPVPTTSTGYQPAYT 
TTRVPKQVAVPATDTTDKMQTSLDEVMKTTKI 1 1 G CFVAVTLLAAAML I VFYKLRKRHQQRS 
TVTAARTVEI IQVDEDI PAATSAAATAAPSGVSGEGAWLPTIHDHINYNTYKPAHGAHWTE 
NSLGNSLHPTVTTISEPYI IQTHTKDKVQETQI 
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FIGURE 158 

CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTT 

TTCTTCAGCTCCTTCTCATCTCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGC 

CCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTGAATATGATCAGATTGAGTGCGT 

CTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAATG 

AGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGA 

AATGGCTCATGGGGGGGTACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTG 

CCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTGGCCAGGTTCTGCGAGCCCCAAAGG 

GTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGCTAAA 

CCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCA 

GTATGACTATGTTGAGGTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCT 

GTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAGGATCCTCACTCCACGTCCTCTTCCAC 

TCCGATGG CT CC AAGAATTTTG ACGGTTTCCATGCCATTTATGAGGAGATCACAGCATGCT C 

CTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTG 

CCTGCTTGGCAGGCTATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCA 

GACCCTGGGGGCCCAGTCAATGGGTACCAGAAAATAACAGGGGGCCCTGGGCTTATCAACGG 

ACGCCATGCTAAAATTGGGACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGTTCTTAGTG 

GCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATA 

AAAGCCTGCCGAGAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGT 

TCAGTCAAGGGAGACACCATTACACCAGCTATACTCAGCGGCCTTCAGCAAGCAGAAACTGC 

AGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGGATACCAACAT 

CTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAG 

GAGGACATGTCTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCG 

GGAAAATTGAGAACATCACTGCTCCAAAGACCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCC 

ATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGCGTGGTTCCTAGT 

CTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACC 

TGGGGAAGGTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGG 

GATGATGACCGGGATGAGAAGACCATCCAGAGCCTACAGATTTCTGCTATCATTCTGCATCC 

CAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCTCCTAGACAAGGCCC 

GTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTC 

CAGGAGTCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTT 

CAAGAACGACACACTGCGCTCTGGGGTGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGC 

AGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTTCTGTGCCAGCTGGGAA 

CCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCC 

GGGACGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATA 

AAACATGCAGCCACAGGCTCTCCACTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATT 

GAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAGTGTTTCTGTATATCCGTC 

TGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGG 

CTGTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGC 

TGGTAGGCTGATGCCGCGTCCACTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAG 

AAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCACTCCACTGACCTGGTGGTCT 

TCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 

GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGC 

TGGGATGTGGTGCATGCCTTTGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCC 

ATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCTGAACTACAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 159 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREVV 
GYTIPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGD 
CMRCGQVLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRD 
GDNRDGQ 1 1 KRVCGNERP AP I Q S I GS S LHVLFHS DGS KNFDG FHAI YEE I TAGS S S PCFHDG 
TCVLDKAGSYKCACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTV 
VSFFCNNSYVLSGNEKRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLH 
QLYSAAFSKQKLQSAPTKKPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGK 
WSGRAP S C I P I CGK I ENI TAPKTQGLRWPWQAAI YRRTSGVHDGSLHKGAWFLVCSGALVNE 
RTVWAAH CVTDLGKVTM I KTADLKWLGKFYRDDDRDE KT I QS LQ I S AI I LHPNYDPILLD 
ADIAILKLLDKARISTRVQPICLAASRDLSTSFQESHI 

WSWDSLLCEEQHEDHGIPVSVTDNMFCASWEPTAPSDICTAETGGIAAVSFPGRASPEPR 
WHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWIERNMK 
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FTGURE 160 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGA 
AGCTTTCTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGC 
TTCAACCTGACTTTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCT 
GACAGTGGTTGGGTGGGCCACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAG 
CAAAGGAGTTCATGGCTAATTTCCATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACT 
AATGAAGCATCCACGAAGAAGGTAGAACTTGACAACTGTCCTTCTGTGTCTCCTTACCTCAG 
AGGCCAGAGCAAGCT CATTTT CAAACCAGAT CT CACTTTGGAAGAGGTACAGGCAGAAAATC 
CCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAGC^^ 

GTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAACATCTGCATCCCTTCCT 
GCAGAGGCAGCZAGCTGGATTATGGCATCTACGTCATCCACC^GGCTGAAGGTAAAAAGTTTA 
ATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGGACTGC 
TTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGA 
GCATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACACkSTTACGTTACAGTGGAT 
ATTTTGGGGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAAC 
AACTACTGGGGATGGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAAT 
GAAAATTTCCCGGCCCCTGCCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACA 
AAGGCAAT G AGGTGAACG CAGAACGG ATGAAG CT CTTACACCAAGTGTCACGAGTCTGGAGA 
ACAGATGGGTTGAGTAGTTGTTCTTATAAATTAGTATCTGTGGAACACAATCCTTTATATAT 
CAACATCACAGTGGATTTCTGGTTTGGTGCATGACCCTGGATCTTTTGGTGATGTTTGGAAG 
AACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAAATAGTAGCACACATTAAGA 
ACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCTTAGCAGAGCTCCT 
GGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGATCATGAGG 
GTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAAA 
ATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATATTAT 
GGGATAAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGT 
CCAAGGTAGAAAGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTG 
AAGTGGTGGTGTCAGGTGAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGA 
CACAGTGAACTTGGGAATGAAGAGGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAG 
TAGCTGAGCTGGTTGCAGGTGCTGATAGCCTTCAGGGGAGGACCTGCCCAGGTATGCCTTCC 
AGTGATGCCCA.CCAGAGAATACATTCTCTATTAGTTTTTAAAGAGTTTTTGTAAAATGATTT 
TGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACATATTAACTAATAATAAATATGT 
CTATCAAATACCT CTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 161 

MGFNLTFHLSYKFRLLIJL.LTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKT 

LTNEASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRV 

AILVPHRNREKHIJmiLEHLHPFljQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALK^EN 

WDCFIFHDVDLVPENDraLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNG 

FSNNYWGWGGEDDDLRI^VELQRMKISRPLPEVGKYTW^ 

VWRTDGLSSCSYKLVSVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites: 

amino acids 4-7, 220-223 and 335-338 



Xylose isomerase proteins: 

amino acids 191-201 
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fi gure m 

CGTGGGCCGGGGTCGCGCAGCGGGCTGTGGGCGCGCCCGGAGGAGCGACCGCCGCAGTTCTC 
GAGCTCCAGCTGCATTCCCTCCGCGTCCGCCCCACGCTTCTCCCGCTCCGGGCCCCGCA^Tg 
GCCCAGGCAGTGTGGTCGCGCCTCGGCCGCATCCTCTGGCTTGCCTGCCTCCTGCCCTGGGC 
CCCGGCAGGGGTGGCCGCAGGCCTGTATGAACTCAATCTCACCACCGATAGCCCTGCCACCA 
CGGGAGCGGTGGTGACCATCTCGGCCAGCCTGGTGGCCAAGGACAACGGCAGCCTGGCCCTG 
CCCGCTGACGCCCACCTCTACCGCTTCCACTGGATCCACACCCCGCTGGTGCTTACTGGCAA 
GATGGAGAAGGGTCTCAGCTCCACCATCCGTGTGGTCGGCCACGTGCCCGGGGAATTCCCGG 
TCTCTGTCTGGGTCACTGCCGCTGACTGCTGGATGTGCCAGCCTGTGGCCAGGGGCTTTGTG 
GTCCTCCCCATCACAGAGTTCCTCGTGGGGGACCTTGTTGTCACCCAGAACACTTCCCTACC 
CTGGCCCAGCTCCTATCTCACTAAGACCGTCCTGAAAGTCTCCTTCCTCCTCCACGACCCGA 
GCAACTTCCTCAAGACCGCCTTGTTTCTCTACAGCTGGGACTTCGGGGACGGGACCCAGATG 
GTGACTGAAGACTCCGTGGTCTATTATAACTATTCCATCATCGGGACCTTCACCGTGAAGCT 
CAAAGTGGTGGCGGAGTGGGAAGAGGTGGAGCCGGATGCCACGAGGGCTGTGAAGCAGAAGA 
CCGGGGACTTCTCCGCCTCGCTGAAGCTGCAGGAAACCCTTCGAGGCATCCAAGTGTTGGGG 
C C CACCCTAATT CAGACCTT CCAAAAGATGACCGTGACCTTGAACTTCCTGGGGAGC CCTC C 
TCTGACTGTGTGCTGGCGTCTCAAGCCTGAGTGCCTCCCGCTGGAGGAAGGGGAGTGCCACC 
CTGTGTCCGTGGCCAGCACAGCGTACAACCTGACCCACACCTTCAGGGACCCTGGGGACTAC 
TG CTTCAG CATCCGGG CCGAGAAT AT CATC AG CAAGACACATCAGTACCACAAGATCCAGGT 
GTGGCCCTCCAGAATCCAGCCGGCTGTCTTTGCTTTCCCATGTGCTACACTTATCACTGTGA 
TGTTGGCCTTCATCATGTACATGACCCTGCGGAATGCCACTCAGCAAAAGGACATGGTGGAG 
AACCCGGAGCCACCCTCTGGGGTCAGGTGCTGCTGCCAGATGTGCTGTGGGCCTTTCTTGCT 
GGAGACTCCATCTGAGTACCTGGAAATTGTTCGTGAGAACCACGGGCTGCTCCCGCCCCTCT 
ATAAGTCTGTCAAAACTTACACCGTGTGAGCACTCCCCCTCCCCACCCCATCTCAGTGTTAA 
CTGACTGCTGACTTGGAGTTTCCAGCAGGGTGGTGTGCACCACTGACCAGGAGGGGTTCATT 
TGCGTGGGGCTGTTGGCCTGGATCATCCATCCATCTGTACAGTTCAGCCACTGCCACAAGCC 
CCTCCCTCTCTGTCACCCCTGACCCCAGCCATTCACCCATCTGTACAGTCCAGCCACTGACA 
TAAGCCCCACTCGGTTACCACCCCCTTGACCCCCTACCTTTGAAGAGGCTTCGTGCAGGACT 
TTGATGCTTGGGGTGTTCCGTGTTGACTCCTAGGTGGGCCTGGCTGCCCACTGCCCATTCCT 
CTCATATTGGCACATCTGCTGTCCATTGGGGGTTCTCAGTTTCCTCCCCCAGACAGCCCTAC 
CTGTGCCAGAGAGCTAGAAAGAAGGTCATAAAGGGTTAAAAATCCATAACTAAAGGTTGTAC 
ACATAGATGGGCACACTCACAGAGAGAAGTGTGCATGTACACACACCACACACACACACACA 
CACACACACACAGAAATATAAACACATGCGTCACATGGGCATTTCAGATGATCAGCTCTGTA 
TCTGGTTAAGTCGGTTGCTGGGATGCACCCTGCACTAGAGCTGAAAGGAAATTTGACCTCCA 
AGCAGCCCTGACAGGTTCTGGGCCCGGGCCCTCCCTTTGTGCTTTGTCTCTGCAGTTCTTGC 
GCCCTTTATAAGGCCATCCTAGTCCCTGCTGGCTGGCAGGGGCCTGGATGGGGGGCAGGACT 
AATACTGAGTGATTGCAGAGTGCTTTATAAATATCACCTTATTTTATCGAAACCCATCTGTG 
AAACTTTCACTGAGGAAAAGGCCTTGCAGCGGTAGAAGAGGTTGAGTCAAGGCCGGGCGCGG 
TGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGATCAGGA 
GATCGAGACCACCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAAATACAAAAAGTT 
AGCCGGGCGTGGTGGTGGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATG 
GTGCGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCAGATGGCGCCACTGCACTCCAGCCTGA 
GTGACAGAGCGAGACTCTGTCTCCA 
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FIGURE 163 

MAQAVWSRLGRI LWLACLLPWAPAGVAAGLYELNLTTDSPATTGAWT I SASLVAKDNGSLA 
LPADAHLYRFHWIHTPLVLTGKMEKGLSSTIRWGHVPGEFPVSVVAn'AADCWMCQPVARGF 
WLP I TEFLVGDLWTQNTS LP WP S S YLTKTVLKVS FLLHDP SNFLKTAL FLY S WDFGDGTQ 
MVTEDSVVYYOTSIIGTFTVTCLKWAEWEEVEPD^ 

GPTLIQTFQKMTVTI.NFLGSPPLTVCWRLKPECLPLEEGECHPVSVASTAYNLTHTFRDPGD 
YC FS I RAENI I S KTHQYHKI QVWP S R I QPAVFAFPCATL I TVMLAF IMYMTLRNATQQKDMV 
ENPEPPSGVRCCCQMCCGPFLLETPSEYLEIVRENHGLLPPLYKSVKTYTV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domain: 

amino acids 339-362 

N-glycosylation sites. 

amino acids 34-37, 58-61, 142-145, 197-200, 300-303 and 364-367 
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FIGURE 164 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCT 
GGGCCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCA 
CAACAGACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTG 
GATGCCCATGTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCT 
GCGGCTG CTGTCATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTG 
CCCCCGTCCCCTCCCTTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAA 
TGGCTGGTTCTTTTGTTTTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 165 

MALSSQIWAACLLLIiLIJ^LTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTH 
FPICIFCCGCCHRSKCGMCCKT 



WO 99/63088 




FTGTFRE 166 

CTGTCAGGAAGGACCATCTGAAGGCTGCAATTTGTTCTTAGGGAGGCAGGTGCTGGCCTGGC 
CTGGATCTTCCACCATGTTCCTGTTGCTGCCTTTTGATAGCCTGATTGTCAACCTTCTGGGC 
ATCTCCCTGACTGTCCTCTTCACCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGG 
AGTCTCCTTTGGTATCCGCAAACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTA 

CCTTGAGAATGGAGCG AGGAG C CAAGGAGAAGAAC C AC CAGCTTT ACAAGCCCT ACACCAAC 
GGAATCATTGCAAAGGATCCCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGG 
TAGTAGTAAGGCTCTGGACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCC 
GGAAAGGAATGGAGACCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTG 
GAGTCCTGGAACCTGCTGAGCAGAACCAATTATAACTTCCAGTACATCAGCCTTCGGCTCAC 
GGTCCTGTGGGGGTTAGGAGTGCTGATTCGGTACTGCTTTCTGCTGCCGCTCAGGATAGCAC 
TGGCTTTCACAGGGATTAGCCTTCTGGTGGTGGGCACAACTGTGGTGGGATACTTGCCT^AAT 
GGGAGGTTTAAGGAATTCATGAGTAAACATGTTCACTTAATGTGTTACCGGATCTGCGTGCG 
AGCGCTGACAGCCATCATCACCTACCATGACAGGGAAAACAGACCAAGAAATGGTGGCATCT 
GTGTGGCCAATCATACCTCACCGATCGATGTGATCATCTTGGCCAGCGATGGCTATTATGCC 
ATGGTGGGTCAAGTGCACGGGGGACTCATGGGTGTGATTCAGAGAGCCATGGTGAAGGCCTG 
CCCACACGTCTGGTTTGAGCGCTCGGAAGTGAAGGATCGCCACCTGGTGGCTAAGAGACTGA 
CTGAACATGTGCAAGATAAAAGCAAGCTGCCTATCCTCATCTTCCCAGAAGGAACCTGCATC 
AATAATACATCGGTGATGATGTTCAAAAAGGGAAGTTTTGAAATTGGAGCCACAGTTTACCC 
TGTTGCTATCAAGTATGACCCTCAATTTGGCGATGCCTTCTGGAACAGCAGCAAATACGGGA 
TGGTGACGTACCTGCTGCGAATGATGACCAGCTGGGCCATTGTCTGCAGCGTGTGGTACCTG 
CCTCCCATGACTAGAGAGGCAGATGAAGATGCTGTCCAGTTTGCGAATAGGGTGAAATCTGC 
CATTGCCAGGCAGGGAGGACTTGTGGACCTGCTGTGGGATGGGGGCCTGAAGAGGGAGAAGG 
TGAAGGACACGTTCAAGGAGGAGCAGCAGAAGCTGTACAGCAAGATGATCGTGGGGAACCAC 
AAGGACAGGAGCCGCTCCTSAGCCTGCCTCCAGCTGGCTGGGGCCACCGTGCGGGGTGCCAA 
CGGGCTCAGAGCTGGAGTTGCCGCCGCCGCCCCCACTGCTGTGTCCTTTCCAGACTCCAGGG 
CTCCCCGGGCTGCTCTGGATCCCAGGACTCCGGCTTTCGCCGAGCCGCAGCGGGATCCCTGT 
GCACCCGGCGCAGCCTACCCTTGGTGGTCTAAACGGATGCTGCTGGGTGTTGCGACCCAGGA 
CGAGATGCCTTGTTTCTTTTACAATAAGTCGTTGGAGGAATGCCATTAAAGTGAACTCCCCA 
CCTTTGCACGCTGTGCGGGCTGAGTGGTTGGGGAGATGTGGCCATGGTCTTGTGCTAGAGAT 
GGCGGTACAAGAGTCTGTTATGCAAGCCCGTGTGCCAGGGATGTGCTGGGGGCGGCCACCCG 
CTCTCCAGGAAAGGCACAGCTGAGGCACTGTGGCTGGCTTCGGCCTCAACATCGCCCCCAGC 
CTTGGAGCTCTGCAGACATGATAGGAAGGAAACTGTCATCTGCAGGGGCTTTCAGCAAAATG 
AAGGGTTAGATTTTTATGCTGCTGCTGATGGGGTTACTAAAGGGAGGGGAAGAGGCCAGGTG 
GGCCGCTGACTGGGCCATGGGGAGAACGTGTGTTCGTACTCCAGGCTAACCCTGAACTCCCC 
ATGTGATGCGCGCTTTGTTGAATGTGTGTCTCGGTTTCCCCATCTGTAATATGAGTCGGGGG 
GAATGGTGGTGATTCCTACCTCACAGGGCTGTTGTGGGGATTAAAGTGCTGCGGGTGAGTGA 
AGGACACATCACGTTCAGTGTTTCAAGTACAGGCCCACAAAACGGGGCACGGCAGGCCTGAG 
CTCAGAGCTGCTGCACTGGGCTTTGGATTTGTTCTTGTGAGTAAATAAAACTGGCT 
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FIGURE 167 

MFLLLPFDSLIVNLLGISLTVLFTLLLVFIIVPAIFGVSFGIRKLYMKSLLKIFAWATLRME 
RGAKEKNHQLYKPYTNGIIAKDPTSLEEEIKEIRRSGSSKALDNTPEFELSDIFYFCRKGME 
TIMDDEVTKRFSAEELESWNLLSRTNYNFQYISLF^ 

ISLLWGTTWGYLPNGRFKEFMSKHVHLMCYRI CVRALTAI I TYHDR ENRPRNGG I CVANH 
TSPIDVIILASDGYYAMVGQVHGGI^GVIQRAMVKACPHVWFERSEVKDRHLVAKRLTEHVQ 
DKSKLPILIFPEGTCINNTSVMMFKKGSFEIGATWPVAIKTDPQFGDAFWNSSKYGMVTYL 
IJlMMTSWAIVCSWYLPPMTREADEDAVQFAmVKSAIARQGGLVDLLWDGGLKREKVKDT^ 
KEEQQKLYS KMI VGNHKDRSRS 
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FIGURE 168 

GCCCCTCGAAACCAGGACTCCAGCACCTCTGGTCCCGCCCTCACCCGGACCCCTGGCCCTCA 

CGTCTCCTCCAGGGATGGCGCTGGCGGCTTTGATGATCGCCCTCGGCAGCCTCGGCCTCCAC 

ACCTGGCAGGCCCAGGCTGTTCCCACCATCCTGCCCCTGGGCCTGGCTCCAGACACCTTTGA 

CGAT ACCT ATGTGGGTTGTG C AG AGG AG ATGG AGG AG AAGG C AG C C CC C CTG CT AAAGG AGG 

AAATGGCCCACCATGCCCTGCTGCGGGAATCCTGGGAGGCAGCCCAGGAGACCTGGGAGGAC 

AAGCGTCGAGGGCTTACCTTGCCCCCTGGCTTCAAAGCCCAGAATGGAATAGCCATTATGGT 

CTACACCAACTCATCGAACACCTTGTACTGGGAGTTGAATCAGGCCGTGCGGACGGGCGGAG 

GCTCCCGGGAGCTCTACATGAGGCACTTTCCCTTCAAGGCCCTGCATTTCTACCTGATCCGG 

GCCCTGCAGCTGCTGCGAGGCAGTGGGGGCTGCAGCAGGGGACCTGGGGAGGTGGTGTTCCG 

AGGTGTGGGCAGCCTTCGCTTTGAACCCAAGAGGCTGGGGGACTCTGTCCGCTTGGGCCAGT 

TTGCCTCCAGCTCCCTGGATAAGGCAGTGGCCCACAGATTTGGGGAGAAGAGGCGGGGCTGT 

GTGTCTGCGCCAGGGGTGCAGCTAGGGTCACAATCTGAGGGGGCCTCCTCTCTGCCCCCCTG 

GAAGACTCTGCTCTTGGCCCCTGGAGAGTTCCAGCTCTCAGGGGTTGGGCCCTGAAAGTCCA 

ACATCTGCCACTTAGGAGCCCTGGGAACGGGTGACCTTCATATGACGAAGAGGCACCTCCAG 

CAGCCTTGAGAAGCAAGAACATGGTTCCGGACCCAGCCCTAGCAGCCTTCTCCCCAACCAGG 

ATGTTGGCCTGGGGAGGCCACAGCAGGGCTGAGGGAACTCTGCTATGTGATGGGGACTTCCT 

GGGACAAGCAAGGAAAGTACTGAGGCAGCCACTTGATTGAACGGTGTTGCAATGTGGAGACA 

TGGAGTTTTATTGAGGTAGCTACGTGATTAAATGGTATTGCAGTGTGGA 
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FIGURE 169 

MALAALMIALGSLGLHTWQAQAVPTILPLGL^ 

ALLRESWEAAQETWEDKRRGLTLPPGFKAQNGIAIMVYTNSSNTLYWELNQAVRTGGGSREL 
YMRHFPFKALHFYLIRALQLLRGSGGCSRGPGEWFRGVGSLRFEPKRLGDSVRLGQFASSS 
LDKAVAHRFGEKRRGCVSAPGVQLGSQSEGASSLPPWKTLLLAPGEFQLSGVGP 
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FIGURE 170 

GTGGCTTCATTTCAGTE^CTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCA 

CCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTG 

GTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTC 

TATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCA 

TAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAG 

CTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACT 

CCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAG 

TCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATG 

GAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTC 

CCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAGAAAGTGATATGACCTTCATCT 

GCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCCATCCTTGCCAGGAAGCTCTGT 

GAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCTGTGTCTCCTGTTGGTGCCCCT 

CCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTCTGAAGAGAGAGAGACAAGAAG 

AGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAAACTCCTAACATATGCCCCCAT 

TCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAATAGAACAATCCTAAAGGAAGA 

TCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGATGGAAAATCCCCACTCAC 

TGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTATC TAG ACAGCAGTG 

CACTCCCCTAAGTCTCTGCTCA 
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FIGURE 171 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVT 
IQPEGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHV 
YEHLSKPfCVTMGLQSNKNGTCTTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISTOW 
GESD^FIOTARNPVSRNFSSPILARKLCEGA^ 

FLKRERQEE Y IEEKKRVD I CRETPN I CPHSGENTEYDT I PHTNRT I LKEDPANTVYSTVE I P 
KKMENPHSLLTMPDTPRLFAYENVI 



WO 99/63088 . PCI7US99/12252 

FIGURE 172 

CTGGTTCCCCAACATGCCTCACCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCC 

TCTGGACCCGTGAAAGAGCTGGTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTC 

CAAAGTAAAGCAAGTTGACTCTATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATAC 

AGCCAGAAGGGGGCACTATCATAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCA 

GATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGT 

GGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACG 

AGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTG 

ACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCT 

GGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAG 

AAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCC 

ATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCT 

GTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTC 

TGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAA 

ACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAA 

TAGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAA 

AGATGGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAG 

AATGTTATCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCAAAAAAAAAAAAAAAAAAA 
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FIGURE 173 

GAAAGACGTGGTCCTGACAGACAGACAATCCTATTCCCTACCAA AATG AAGATGCTGCTGCT 

GCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGAAGCTAGTTCTACGGGAAGGA 

ACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATCCTGGCCTCTGACAAAAGA 

GAAAAGATAGAAGAACATGGCAACTTTAGACTTTTT CTGGAG CAAATCCATGTCTTGGAGAA 

TTCCTTAGTTCTTAAAGTCCATACTGTAAGAGATGAAGAGTGCTCCGAATTATCTATGGTTG 

CTGACAAAACAGAAAAGGCTGGTGAATATTCTGTGACGTATGATGGATTCAATACATTTACT 

ATACCTAAGACAGACTATGATAACTTTCTTATGGCTCACCTCATTAACGAAAAGGATGGGGA 

AACCTTCCAGCTGATGGGGCTCTATGGCCGAGAACCAGATTTGAGTTCAGACATCAAGGAAA 

GGTTTGCACAACTATGTGAGGAGCATGGAATCCTTAGAGAAAATATCATTGACCTATCCAAT 

GCCAATCGCTGCCTCCAGGCCCGAGAATGAAGAATGGCCTGAGCCTCCAGTGTTGAGTGGAC 

ACTTCTCACCAGGACTCCACCATCATCCCTTCCTATCCATACAGCATCCCCAGTATAAATTC 

TGTGATCTGCATTCCATCCTGTCTCACTGAGAAGTCCAATTCCAGTCTATCAACATGTTACC 

TAGGATACCTCATCAAGAATCAAAGACTTCTTTAAATTTCTCTTTGATACACCCTTGACAAT 

TTTTCATGAAATTATTCCTCTTCCTGTTCAATAAATGATTACCCTTGCACTTAA 



WO 99/63088 . PCT/US99/12252 

FIGURE 174 

MKMLLLLCLGLTLVCVHAEEASSTGRNFNVEKINGSWHTI ILASDKREKIEEHGNFRLFLEQ 
IHVLENSLVLK^HTVRDEECSELSWADKTEKAGEYSVTyDGFNTFTIPKTDYDNFLMAHLI 
NEKDGETFQLMGLYGRE PDLS S D I KERFAQL CEEHG I LREN 1 1 DLSNANRCLQARE 
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FIGURE 175 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAA 
TGGATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTG 
TCAGCTTAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTC 
CCAGG AAT TATAGG AG CAGGT C TGATGG C C ATTC CAG C AACAACAATGT C CTTG ACAG C AAG 
AAAAAGAGCGTGCTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCA 
CAGTCATTGGTGCTCTGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTC 
ATGTGTAATTCTCCAAGCAACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGA 
CATTCATCCAGAATCCTTCAACTTGCAGTGGTTTTTCT^TGACTCTTGTGCACCTCCTACTG 
GTTTCAATAAACCCACCAGTAACGACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCAC 
TTCGATTCTGAAGAAAACAAACATAGGCTTATCCACTTCTCAGTATTTTTAGGTCTATTGCT 
TGTTGGAATTCTGGAGGTCCTGTTTGGGCTCAGTCAGATAGTCATCGGTTTCCTTGGCTGTC 
TGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTGTGTAGTTTAATGGGAATAAAATGTAAGTA 
TCAGTAGTTTGAAAAAAAAAAA 
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FIGURE 176 



MTCCEGWTSCNGFSLLVLLLLGVVLNAI PL I VSLVE EDQFSQNP I S CFEWWFPG 1 1 GAGLMA 
I PATTMSLTARKRACCNNRTGMFLSSFFS V I TVI GALY CML I S I QALLKGPLMCNS PSNSNA 
NCEFSLKNISDIHPESFNLQWFFNDSGAPPTGFNKPTSTOTMASGWRASSFHFDSEENKHRL 
IHFSVFLGLLLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 
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FIGURE 177 

GT CGAAT C C AAATCACTCATTG TGAAAG CTG AG CT C AC AG C C G AAT AAG C CAC CATGAGGC T 
GTCAGTGTGTCTCCTGATGGTCTCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCT 
GCCCAGCTGTTGCTTCTGAGATCACAGTCTTCTTATTCTTAAGTGACGCTGCGGTAAACCTC 
CAAGTTGCCAAACTTAATCCACCTCCAGAAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTG 
CACCGATCAGATATCTTTTAAGAAACGACTCTCATTGAAAAAGTCCTGGTGGAAA2AGTGAA 
AAAATGTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTCCAAAGTCTTTCAACGACACC 
CTGATCTTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAATAAATCACTTGCCCTGC 
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FIGURE 178 



MRLS VCLLMVSI^CCTQAHALVCPAVASE I 
KHCTDQISFKKRLSLKKSWWK 
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FTGTTRE 179 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGG 
TGAAGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTG 
GAGCAGATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTC 
GAACTGTGACATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGG 
AAGCCAATGACCCATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAG 
CTGAGCGGACTGATCTGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGG 
CAAATG CAAATACAAGAG CAGC CAGAAG CAGC ACAG T C CT GT ACCTG AGAAGGCCATCCCAC 
TCATCACTCCAGGCTCTGCCACTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGA 
AGCCTAACACTGGCCCCCAGCACCTCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTC 
TCTCCAAGGGCAGGCTGTTAGGCCCCTTTCTGATCAGGAGGCTTCTTTATGAATTAAACTCG 
CCCCACCACCCCCTCA 
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FIGURE 1 80 



MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCK 
YKSSQKQHSPVPEKAIPLITPGSATTC 
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FIGURE 181 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCC 
TGGCTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTC 
CTGGCTACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCC 
AGTGTTTCCCACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCT 
ACAGAGGAGGGCTTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGT 
ATGGCTGGGTCCCATCATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCA 
CCAATGCCTCAGCTGCCATTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGG 
CTGGGAGAAGGGATACTGCTGAGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGAC 
GCCCGCCTTCCATTTCAACATCCTGAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACA 
TCATGCTTGACAAGTGGCAGCACCTGGCCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAG 
CACATCAG CCT CATGAC CTTGGACAGT CTACAGAAATG CATCTTCAG CTTTGACAG CCATTG 
TCAGGAGAGGCCCAGTGAATATATTGCCACCATCTTGGAGCTCAGTGCCCTTGTAGAGAAAA 
GAAGCCAGCATATCCTCCAGCACATGGACTTTCTGTATTACCTCTCCCATGACGGGCGGCGC 
TTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGACGCTGTCATCCGGGAGCGGCGTCG 
CACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAAAGCCAAGTCCAAGACTTTGG 
ATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGGCATTGTCAGATGAGGAT 
ATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACGGCCAGTGGCCTCTC 
CTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCGACAGGAGGTGC 
AAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCCAGCTGCCC 
TTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATCTCCCG 
ATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTGCC 
TCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGAC 
CCCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCn'GGCTTTTATTCCTTTCTC 
CGCAGGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGG 
CGTTGATGCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAA 
TTGATCATGCGCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCA 
GTGACTTTCTGACCCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 182 

msllslpv^glrpvamspwlllllwgswli^rilawtyafynncrrlqcfpqppkrnwfwg 
hlglitpteeglkdstqmsatysqgftw7lgp 1 1 pf i vlchpdtirs i tnasaai apkdnlf 
irflkpwlgegillsggdkwsrhrrmltpafhfnilksyitifnksanimldkwqhiasegs 
sriadmfehislmtldslqkcifsfdshcqerpseyiatilelsalvekrsqhilqhmdflyy 
lshdgrrfhracrlvhdftdavirerrrtlptqgiddffkdkaksktldfidvlllskdedg 
kalsdediraeadt™fgghdttasglswvlyniju^hpeyqercroevqellkdrdpkeiew 
ddlaqlpfltmcvkeslrlhppapfisrcctqdivlpdgrvipkgitclidiigvhhnptvw 
pdpevydpfrfdpenskgrsplafipfsagprncigqafamaemkvvi^mliihfrflpdht 

E PRRKLEL I MRAEGGLWLRVE PLNVGLQ 
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FIGURE 183 

CAACAGAAG C CAAGAAGGAAG CCGT CTATCTTGTGG CGAT CATGTATAAG CTGG CCTC CTG C 
TGTTTGCTTTTCACAGGATTCTTAAATCCTCTCTTATCTCTTCCTCTCCTTGACTCCAGGGA 
AATATCCTTTCAACTCTCAGCACCTCATGAAGACGCGCGCTTAACTCCGGAGGAGCTAGAAA 
GAGCTTCCCTTCTACAGATATTGCCAGAGATGCTGGGTGCAGAAAGAGGGGATATTCTCAGG 
AAAG C AG ACT CAAGTACC AACAT TTT TAAC C C AAGAGG AAAT TT G AG AAAGTTT CAGG ATTT 
CTCTGGACAAGATCCTAACATTTTACTGAGTCATCTTTTGGCCAGAATCTGGAAACCATACA 
AGAAACGTGAGACTCCTGATTGCTTCTGGAAATACTGTGTCTGAAGTGAAATAAGCATCTGT 
TAGTCAGCTCAGAAACACCCATCTTAGAATATGAAAAATAACACAATGCTTGATTTGAAAAC 
AGTGTGGAGAAAAACTAGGCAAACTACACCCTGTTCATTGTTACCTGGAAAATAAATCCTCT 
ATGTTTTG CACAAAAAAAAAAAAAAA 
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FIGURE 184 



MYKLASCCLLFTGFLNPLLSLPLLDSREISFQLSAPHEDARLTPEELERASLLQILPEMLGA 
ERGDILRKADSSTNIFNPRGNLRKFQDFSGQDPNILLSHLLARIWKPYKKRETPDCFWKYCV 
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FIGURE 185 

GAACATTTTTAGTTCCCAAGGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGGAT 
GGGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCA 
CCACCTCCGCCAGGAACTGCAGGCCCACCTGTCTGCAACCCAGCTGAGGCCATGCCCTCCCC 
AGGGACCGTCTGCAGCCTCCTGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCT 
CCAGCTTCCTGAGCCCTGAACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACCA 
GCCAAGCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCGCCCGGAAGATGGAGGTCAAGCAGA 
AGGGGCAGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGT 
CAGGGGTTCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCTCTGG 
GAAGAGGCCAAAGAGGCCCCAGCCGACAAGTGATCGCCCACAAGCCTTACTCACCTCTCTCT 
AAGTTTAGAAGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTA 
CAAG CT CAG G AGG CGAATAAAT G TT CAAACTGT A 
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FIGURE 186 

MPSPGTVCSLlLLLGMLWLDLlAJ^GS 

GQ AEGAEDELEVRFNAPFDVG IKLSGVQYQQHSQALGKFLQD I LWEEAKEAPADKO 
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FIGURK 1 87 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTC 

GTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTGCTGT 

TCCTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATGCTCGTG 

CTCCTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTCCACATCTACCT 

GAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATC 

TTACACCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTT 

GTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAA 

GGGCAAATCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAAC 

TCACGCCGAGACCTGCAGGAGTGGTGCCAGGTGCTTGAAGTAACAAGTTTAAAATGTTCAGA 

GACAATGGAATGGAATCTATTAGGCAAGAACAGGACATTATGAAATAAGGACAGGTGGACTT 

C C AAAAAC AC AAG T AG AAAT T C T AAC AATGAAAT ATATT ACAGG CAGG TCACC C ACTAACCA 

AACAACTGAAGCGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTC 

AGTCATGTTGCTGAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGT 

AACAACAACCTCCCTGCTCCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCG 

TCTTCCATTCTCTGGGCAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGAC 

ACGGGCAGCAGAGTGTGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCA 

ACCCCTGGTCAGGGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGA 

GCATCCCCTGCCTGCAGTTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGA 

GCCTCCTTGTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGAT 

GTTTTAAAACACACACCTCTAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGA 

ATTTTTTTTAATGAAAGTGCAATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 188 

MNTWLLFLPLFPVQVQTLIWIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQGW 
WRAAHLTPLLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSEIMDKKGKSQEEIKSMRTQQ 
AQQEAELTPRPAGWPGA 
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FIGURE 189 

GGAGTGCAGATGGCATCCTTCGGTTCTTCCAGACAAGCTGCAAGACGCTGACCATGGCCAAG 

ATGGAGCTCTCGAAGGCCTTCTCTGGCCAGCGGACACTCCTATCTGCCATCCTCAGCATGCT 

ATCACTCAGCTTCTCCACAACATCCCTGCTCAGCAACTACTGGTTTGTGGGCACACAGAAGG 

TGCCCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTG 

GATGGAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGATGA 

CCGGTTCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAG 

AACCAGGGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAA 

GGACTACTGGAATTTGCCACGTTGCAAGGCCCATGTCACCCCACTCTCCGATTTGGAGGGAA 

GCGGTTGATGGAGAAGGCTTCCCTCCCCTCCCCTCCCTTGGGGCTTTGTGGCAAAAATCCTA 

TGGTTATCCCTGGGAACGCAGATCACCTACATCGGACTTCAATTCATCAGCTTCCTCCTGCT 

ACTAACAGACTTGCTACTCACTGGGAACCCTGCCTGTGGGCTCAAACTGAGCGCCTTTGCTG 

CTGTTTCCTCTGTCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTC 

TTCCAAGCGACTGTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTG 

GGCCTTCTACATGGCCTGGCTCTCCTTCACCTGCTGCATGGCGTCGGCTGTCACCACCTTCA 

ACACGTACACCAGGATGGTGCTGGAGTTCAAGTGCAAGCATAGTAAGAGCTTCAAGGAAAAC 

CCGAACTGCCTACCACATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCAC 

CGTGGGTCCTTTGACCAGCTACCACCAGTATCATAATCAGCCCATCCACTCTGTCTCTGAGG 

GAGT CGACT TCT ACT C CG AG CT GC GG AAC AAG GG AT TT CAAAG AGG GG C C AG CCAGGAG CTG 

AAAGAAGCAGTTAGGTCATCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGA 

GTAGGCTTGAGCCCTACCTTACACGTCTGCTGATTATCAACATGTGCTTAAGCCAACATCCG 

TCTCTTGAGCATGGTTTTTAGAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTC 

CTAAGGGATTCCTGGGTGCCACTGCTCTCTTTTCCTCTACAGCTCCATCTTGTTTCACCCAC 

CCCACATCTCACACATCCAGAATTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGC 

TAAACCATGGAGATAAAAAGAAGAGTAAAATACACTTCCCGACCTTAAGGATCTGAAA 



WO 99/63088 . PCT/US99/12252 

FIGURE 190 

MAKMELSKAFSGQRTLLSAILSWLSLSFSTTSLLSNYWFVGTQKVPKPLCEKGLAAKCFDMP 
VSLDGDTNTSTQEWQYNWETGDDRFSFRSFRSGMWLSCEETVEEPGERCRSFIELTPPAKR 
GEKGLLEFATLQGPCHPTLRFGGKRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQL 
PPATNR1ATHWEPCLWAQTERLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLE 
LWLGLLHGLALLHLLHGVGCHHLQHVHQDGAGVQVQA 
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FIGURE 191 

AACTGGAAGGAAAGAAAGAAAGGTCAGCTTTGGCCCAGATGTGGTTACCCCTTGGTCTCCTG 

TCTTTATGTCTTTCTCCTCTTCCTATTCTGTCATCTCCCTCACTTAAGTCTCAGGCCTGTCA 

GCAGCTCCTGTGGACATTGCCATCCCCTCTGGTAGCCTTCAGAGCAAACAGGACAACCTATG 

TTATGGATGTTTCCACCAACCAGGGTAGTGGCATGGAGCACCGTAACCATCTGTGCTTCTGT 

GAT CT CT ATGACAG AG C CAC TT CT C C AC CTCTG AAATG TT CC CTG C TC TG AAAT CTG G CATG 

AGATGGCACAGGTGACCACGCAGAAGCCACCAGAATCTTGCCTGCCCTATTCCTCCTCCCAA 

GTCTGTTCTCTTATTGTCAACCTCAGCACAACAGGCTGGCGCCAATGGCATTACAGAGAAAG 

CAATCTGTGTGGCTAGTGGGCAGATTACCATGCAAGCCCCAGGAGAAATGGAGGAGCTTTGT 

AGCCACCTCCCTGTCT^GCCAGTATTAACATGTCCCCTTCCCCCTGCCCCGCCGTAGATTCAG 

GACATTCGCCCCTGTGTGCCACCAAACCAGGACTTTCCCCTTGGCTTGGCATCCCTGGCTCT 

CTCCTGGTACCCAGCAAGACGTCTGTTCCAGGGCAGTGTAGCATCTTTCAAGCTCCGTTACT 

ATGGCGATGGCCATGATGTTACAATCCCACTTGCCTGAATAATCAAGTGGGAAGGGGAAGCA 

GAGGGAAATGGGGCCATGTGAATGCAGCTGCTCTGTTCTCCCTACCCTGAGGAAAAACCAAA 

GGGAAGCAACAGGAACTTCTGCAACTGGTTTTTATCGGAAAGATCATCCTGCCTGCAGATGC 

TGTTGAAGGGGCACAAGAAATGTAGCTGGAGAAGATTGATGAAAGTGCAGGTGTGTAAGGAA 

ATAGAACAGTCTGCTGGGAGTCAGACCTGGAATTCTGATTCCAAACTCTTTATTACTTTGGG 

AAGTCACTCAGCCTCCCCGTAGCCATCTCCAGGGTGACGGAACCCAGTGTATTACCTGCTGG 

AACCAAGGAAACTAACAATGTAGGTTACTAGTGAATACCCCAATGGTTTCTCCAATTATGCC 

CATGCCACCAAAACAATAAAACAAAATTCTCTAACACTGAAA 
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FIGURE 192 



MWLPLGLLSLCLSPLPILSSPSLKSQACQQLLWTLPSPLVAFRANRTTYVMDVSTNQGSGME 
HRNHLCFCDLYDRATSPPLKCSLL 
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FIGURE 193 

GTAGCGCGTCTTGGGTCTCCCGGCTGCCGCTGCTGCCGCCGCCGCCTCGGGTCGTGGAGCCA 
GGAGCGACGTCACCGCCATGGCAGGCATCAAAGCTTTGATTAGTTTGTCCTTTGGAGGAGCA 
ATCGGACTGATGTTTTTGATGCTTGGATGTGCCCTTCCAATATACAACAAATACTGGCCCCT 
CTTTGTTCTATTTTTTTACATCCTTTCACCTATTCCATACTGCATAGCAAGAAGATTAGTGG 

ATGATACAGATG CTATGAGTAACG CTTGTAAGGAACTTG C CATCTTTCTT ACAACGGG CATT 
GTCGTGTCAGCTTTTGGACTCCCTATTGTATTTGCCAGAGCACATCTGATTGAGTGGGGAGC 
TTGTGCACTTGTTCTCACAGGAAACACAGTCATCTTTGCAACTATACTAGGCTTTTTCTTGG 
TCTTTGGAAGCAATGACGACTTCAGCTGGCAGCAGTGGTGAAAAGAAATTACTGAACTATTG 
TCAAATGGACTTCCTGTCATTTGTTGGCCATTCACGCACACAGGAGATGGGGCAGTTAATGC 
TGAATGGTATAGCAAGCCTCTTGGGGGTATTTTAGGTGCTCCCTTCTCACTTTTATTGTAAG 
CATACTATTTTCACAGAGACTTGCTGAAGGATTAAAAGGATTTTCTCTTTTGGAAAAGCTTG 
ACTGATTTC^CACTTATCTATAGTATGCTTTTTGTGGTGTCCTGCTGAATTTAAATATTTAT 
GTGTTTTTCCTGTTAGGTTGATTTTTTTTGGAATCAATATGCAATGTTAAACACTTTTTTAA 

TGTAAT CATTTG CATTGGTTAGGAATT CAGAATTC CGCCGG CTCTATTACTGGT CAAGTACA 
TCTTTTCTCTTAAAATTATTTAGCCTCCATTATTACAAAAAATTATAAAAATAAGTTTTCAG 
TCAGTCAGGATGACATCACTCCCAATGTTATGCAGACATACAG.ACGGTTGGCATACGTTATA 
GACTGTATACTCAGTGCAAATATAGCTGCATTTATACCTCAGAGGGGCCAAGTGTTAATGCC 
CATGCCCTCCGTTAAGGGTTGTTGGTTTTACTGGTAGACAGATGTTTTGTGGATTGAAAATT 
ATTTTATGGAATTGCTACAGAGGAGTGCTTTTCTTCTCAATTGTTAGAAGAATTTATGTTAA 
ACTTTAAGGTAAGGGTGTAAAAACATTTTTGAGATAAGGTTTTTATTTATGTTTATTATTGT 
TAGAGTGAGTTGCAATGTGGGAAGAAATGACATTGAAATTCCAGTTTTTGAATCCTGTTTCT 
ATTTATAAGTGAAATTTGTGATCTCCTATCAACCTTTCATGTTTTACCCTGTTAAAATGGAC 
ATACATGGAACCACTACTGATGAGGGACAGTTGTATGTTTGCATCATATATGCCAGAAAACC 
TTCCTCTGCTTCCTCCTTTTGACTTATTTGGTATGTTGTATATATTACATAAAATAACTTTT 
CAAATATAGTTTAATAACACTTAGAAGTGTTTACTTACCTGGAAAATAATTGCTATGCCGTA 
CATTCAGAGTGCCCCCTCCCCTGCAAGGCCTTGCCATGATTAACAAGTAACTTGTTAGTCTT 
ACAGATAATTCATGCATTAACAGTTTAAGATTTAGACCATGGTAATAGTAGTTCTTATTCTC 
TAAGGTTATATCATATGTAATTTAAAAGTATTTTTAAGACAAGTTTCCTGTATACCTCTGAA 
CTGTTTTGATTTTGAGTTCATCATGATAGATCTGCTGTTTCCTTATAAAAGGCATTTGTTGT 
GTGAGTTAATGCAAAGTAGCCAAGTCCAGCTATATAGCAGCTTCAGAAACATACCTGACCAA 
AAAATTC C CAGTAA CCAGGCATGATCAATTTATAGTGGTCGTTTACATCTAATAATTATCAG 
GACTTTTTTCAGGAGTGGGTTATAAAAACATTCAAGTTGGTCTGACAGTATTTTGTTAAGGA 
TATTTGTTTGTATGTTTATTCAGTATACTTACATAAAAATTATTTCGCCATCAGCCAAAACT 
CAGTAATCATGACAGCTGTCTGTTGTTTTATGAAGTTTATTTCTCAAGAAAATGGGAATAAA 
TTTGGGATTTGTTCAGCTTTTTTACTAAAGATGCCTAAAGCCACAGGTTTTATTGCCTAACT 
TAAG CCATGACTTTTAGATATGAGATGACGGG AAGCAGGACGAAATAT CGGCGTGTGG CTGG 
AGCCTTCCCACTGGAGGCTGAAAGTGGCTTGTGGTATTATAATGTTCAGATTTCAAGAGGAA 
GGTGCAGGTACACATGAGTTAGAGAGCTGGTGAGACAGTTGGGAACTCTTTGTGCTTGTGAT 
CTACTGGACTTTTTTTTTGCAGGAAGTGCATTCTCTGGTCCTTCCCTATTTTCTGTTCTGGA 
TGTCAGTGCAGTGCACTGCTACTGTTTTATCCACTTGGCCACAGACTTTTTCTAACAGCTGC 
GTATTATTTCTATATACTAATTGCATTGGCAGCATTGTGTCTTTGACCTTGTATACTAGCTT 
GACATAGTGCTGTCTCTGATTTCTAGGCTAGTTACTTGAGATATGAATTTTCCATAGAATAT 
GCACTGATACAACATTACCATTCTTCTATGGAAAGAAAACTTTTGATGATGAAACAATAAAG 
ATTTTAAATAT CTATTTTAAAAAAAAAA 
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FIGURE 194 

MAGIKALISLSFGGAIGLMFLMLGCALPIYNKYWPLFVLFFYILSPIPYCIARRLVDDTDAM 
SNACKELAI FLTTGI WS AFGLP I VF ARAHLI EWGACALVLTGNTVT FAT ILGFFLVFGSND 
DFSWQQW 
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FIGURE 195A 

CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCC 

CACGCGTCCGCCCACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGA 

GCCCGGGCGCCTCTCGCCGCTCCCCGCGCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCG 

CCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCTGCAGCTGCCCGGGGCGCGAATGG 

CAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTCCAATGACGGACATT 

AACCAGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTTTTCCCCCCACAA 

CGTCACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAAGCTCGGGCTCCGGC 

ACGTAGTTGGGAAACTTGCGGGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTGCA 

GCCCCGAGCCGAGCAGCAAAGTGAGACATTGTGCGCCTGCCAGATCCGCCGGCCGCGGACCG 

GGGCTGCCT CGGAAACACAG AGGGGT CTT CT CTCG C C CT G C AT AT AATT AG CCTG CACACAA 

AGGG AG CAGCTGAATGGAGG TTGTCACT CT CTGGAAAAGGATTTCTGACCGAG CG CTTC CAA 

TGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTAATGGATTTCCTGCTGCTCGGTCTCT 

GTCTATACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTT 

CAGATGCTGCCCGCCGCCCCCAGCGGGTGCCCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCT 

GTACTGCGAGGCGCTCAACCTCACCGAGGCGCCCCACAACCTGTCCGGCCTGCTGGGCTTGT 

CCCTGCGCTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTC 

ACGTGGCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACT 

GCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCT 

TCCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGCG 

CCCGACCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCA 

GTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTCAAGTTTCTCGACATCGGATACA 

ATCAGCTCAAGAGTCTGGCGCGCAACTCTTTCGCCGGCTTGTTTAAGCTCACCGAGCTGCAC 

CTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCGTGCA 

CTCGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGGTCAGCTCGCTGGACTGGGTTTGGA 

ACCTGGAGAAAATGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAG 

ACCGTGCCGCACCTGCAGTCCCTGCAGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCG 

GATCCTCAACTCTTGGAAGTCCCTGACAAGCATCACCCTGGCCGGGAACCTGTGGGATTGCG 

GGCGCAACGTGTGTGCCCTAGCCTCGTGGCTCAGCAACTTCCAGGGGCGCTACGATGGCAAC 

TTGCAGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTT 

CCACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACC 

GCAGTGATCTGGGGCCCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAG 

CACGACGGCACATTCGAGCCTGCCACCGTGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGC 

CGTGCAGATCCACAAGGTGGTCACGGGCACCATGGCCCTCATCTTCTCCTTCCTCATCGTGG- 

TCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCAGACAGTGC 

TTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATGCATCAGATGGCTGCCATGTC 

TGCCCAGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCCTGGTGATCATCA 

ACGAGTATGGCTCGTGTACCTGCCACCAGCAGCCCGCGAGGGAATGCGAGGTGTGATTGTCC 

CAGTGGCTCTCAACCCATGCGCTACCAAATACGCCTGGGCAGCCGGGACGGGCCGGCGGGCA 

CCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGACTGAAACTTTAAGGGGATC 

TCTCCCAGAGACTTGACATTTTAGCTTTATTGTGTCTTAAAAACAAAAGCGAATTAAAACAC 

AACAAAAAACCCCACCCCACAACCTTCAGGACAGTCTATCTTAAATTTCATATGAGAACTCC 

TTCCTCCCTTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAAAAAGGTGGCCATAA 

GACAGAGAGAGAATAATCGTGCTTTGTTTTATGCTACTCCTCCCACCCTGCCCATGATTAAA 

CATCATGTATGTAGAAGATCTTAAGTCCATACGCATTTCATGAAGAACCATTGGAAAGAGGA 

ATCTGCAATCTGGGAGCTTAAGAGCAAATGATGACCATAGAAAGCTATGTTCTTACTTTGTG 

TGTGTGTCTGTATGTTTCTGCGTTGTGTGTCTTTGTAGGCAAGCAAACGTTGTCTACACAAA 

CGGGAATTTAGCTCACATCATTTCATGCCCCTGTGCCTCTAGCTCTGGAGATTGGTGGGGGG 

AGGTGGGGGGAAACGGCAGGAATAAGGGAAAGTGGTAGTTTTAACTAAGGTTTTGTAACACT 

TGAAATCTTTTCTTTCTCAAATTAATTATCTTTAAGCTTCAAGAAACTTGCTCTGACCCCTC 

TAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATTTTTAAAGGTGTAGCACCTTTTTTTT 

TATT CTT C C C AC AG AG GG TG CT AATCTCATTATG CTG TG CTATCTGAAAAGAACTT AAGG C C 

ACAATTCACGTCTCGTCCTGGGCATTGTGATGGATTGACCCTCCATTTGCAGTACCTTCCCA 

GCTGATTAAAGTTCAGCAGTGGTATTGAGGTTTTTCGAATATTTATATAGAAAAAAAGTCTT 

TTCACATGACAAATGACACTCTCACACCAGTCTTAGCCCTAGTAGTTTTTTAGGTTGGACCA 

GAGGAAGCAGGTTAAATGAGACCTGTCCTCTGCTGCACTCAGAAAAAATAGGCAGTCCCTGA 

TGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACCTACCCACAATGCAGCCTATACT 

CCCAAGACTACAAAGTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAAATAGTTTTC 

TCAACCATTTAAAAATATTCTTCTGAACTCATCAAAGTAGAAGAGCCCCCAACCTTTTCTCT 

CTG C CTT CAAGAAGG CAGACATTTGGTATG AT TTAG CAT CAACAACACATTT ATGAGTATAT 
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FIGURE 195ft 

GTAAGTAATCAGAGGGGCAAATGCCACTTGTTATTCCTCCCAAGTTTTCCAAGCAAGTACAC 

ACAGATCTCTGGTAGGATTAGGGGCCACTTGTGTTTCCGGCTTATTTTAGTCGACTTGTCAG 

CAAGTTTGATGCCTAGTCTATCTGACATGGCCCAGTAGAACAGGGCATTGATGGATCACATG 

AGATGGTAGAAGGAACATCATCACATACCCCTCTCACAGAGAAAATTATCAAAGAACCAGAA 

ATTATATCTGTTTTGGAGCAAGAGTGTCATAATGTTTCAGGGTAGTCT^AAATAAACATAAAT 

TATCTCCTCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACAGAATGTCAAA 

TAAAAAGGAATTAGCTAGAATATGACCATTAAATGTGCTTCTGAAATATATTTTGAGATAGG 
TTTAGAATGTCA 
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FIGURE 196 

MDFLLLGLCLYWLLRRPSGVVLCLLGACFQKLPAAPSGCPQLCRCEGRLLYCEALNLTEAPH 
NLSGLLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAFQKLRRVKELTLSSNQ 
ITQLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIFQDCRS 
LKFLDIGYNQLKSLARNSFAGLFKLTELHLEHNDLVK^ 

VSSLDWVWNLEKMDLSGNE I EYME PHVFETVPHLQS LQLDSNRLTYI EPR ILNS WKSLTS I T 
LAGNLWDCGRNVCAIJVSWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDGAEPTSG 
HLLSAVTNRSDLGPPASSATTIJ^GGEGQHIX3TFEPATVALPGGEHAENAVQIHKVVTGTMA 
LIFSFLIVVLVLYVSWKCFPASLRQLRQCFVTQRRKQKQKQTMHQMAAMSAQEYYVDYKPNH 
IEGALVIINEYGSCTCHQQPARECEV 
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FIGURE 197 

GTGCAAGGAGCCGAGGCGAGATGGGCGTCCTGGGCCGGGTCCTGCTGTGGCTGCAGCTCTGC 
GCACTGACCCAGGCGGTCTCCAAACTCTGGGTCCCCAACACGGACTTCGACGTCGCAGCCAA 
CTGGAGCCAGAACCGGACCCCGTGCGCCGGCGGCGCCGTTGAGTTCCCGGCGGACAAGATGG 
TGTCAGTCCTGGTGCAAGAAGGTCACGCCGTCTCAGACATGCTCCTGCCGCTGGATGGGGAA 
CTCGTCCTGGCTTCAGGAGCCGGATTCGGCGTCTCAGACGTGGGCTCGCACCTGGACTGTGG 
CGCGGGCGAACCTGCCGTCTTCCGCGACTCTGACCGCTTCTCCTGGCATGACCCGCACCTGT 
GGCGCTCTGGGGACGAGGCACCTGGCCTCTTCTTCGTGGACGCCGAGCGCGTGCCCTGCCGC 
CACGACGACGTCTTCTTTCCGCCTAGTGCCTCCTTCCGCGTGGGGCTCGGCCCrGGCGCTAG 
CCCCGTGCGTGTCCGCAGCATCTCGGCTCTGGGCCGGACGTTCACGCGCGACGAGGACCTGG 
CTGTTTTCCTGGCGTCCCGCGCGGGCCGCCTACGCTTCCACGGGCCGGGCGCGCTGAGCGTG 
GGCCCCGAGGACTGCGCGGACCCGTCGGGCTGCGTCTGCGGCAACGCGGAGGCGCAGCCGTG 
GATCTGCGCGGCCCTGCTCCAGCCCCT 



WO 99/63088 ^ ^ 2.^3 ^ D PCT/US99/12252 

FIGURE 198 

MGVLGRVLLWLQLCALTQAVSKLOTPNTDFDVAAN^ 

GHAVSDMLLPLIX2ELVLASGAGFGVSDVGSHLDCGAGEPAVFRDSDRFSWHDPHLWRSGDEA 
PGLFFVDAERVPCRHDDVFFPPSASFRVGLGPGASPVRVRSISALGRTFTRDEDLAVFLASR 
AGRLRFHGPGALSVGPEDCADPSGCVCGNAEAQPWICAALLQP 
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FIGURE 199 

atcgcatcaattgggagtaccatcttcctcatgggaccagtgaaacagctgaagcgaatgtt 

tgagcctactcgtttgattgcaactatcatggtgctgttgtgttttgcacttaccctgtgtt 

ctgccttttggtggcataacaagggacttgcacttatcttctgcattttgcagtctttggca 

ttgacgt ggtacag c cttt c ctt cat ac catttg caaggg atg ctgtg aagaagtgttttg c 

cgtgtgtcttg»taattcatggccagttttatgaagctttggaaggcactatggacagaag 

ctggtggacagttttgtaactatcttcgaaacctctgtcttacagacatgtgccttttatct 

tgcagcaatgtgttgcttgtgattcgaacatttgagggttacttttggaagcaacaatacat 

tctcgaacctgaatgtcagtagcacaggatgagaagtgggttctgtatcttgtggagtggaa 

tcttcctcatgtacctgtttcctctctggatgttgtcccactgaattcccatgaatacaaac 

ctattcagcaacagc^aaaaaaaaaaaaaaaaaaaaaaaaaaa^^ 

aaaaaaaaaaaaaaaaaaaaaaaa 
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FIGURE 200 

MGPVKQLKIWFEPTRLIATIMVLLCFA^ 
FARDAVKKCFAVCLA 
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FIGURE 201 

TTGAGCGCAGGTGAGCTCCTGCGCGTTCCGGGGGCGTTCCTCCAGTCACCCTCCCGCCGTTA 
CCCGCGGCGCGCCCGAGGGAGTCTCCTCCAGACCCTCCCTCCCGTTGCTCCAAACTAATACG 
GACTGAACGGATCGCTGCGAGGGTGGGAGAGAAAATTAGGGGGAGAAAGGACAGAGAGAGCA 
ACTACC^TCCATAGCCAGATAGATTATCTTACACTGAACTGATCAAGTACTTTGAAAATGAC 
TTCGAAATTTATCTTGGTGTCCTTCATACTTGCTGCACTGAGTCTTTCAACCACCTTTTCTC 
TCCAACTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCCGTTGGGATTACTTA 
TATAAAGTTCCAACGCCCCATTTTCATTATATTATGAAATATGGTGTTCACGTGAAGCAAGT 
TACTAATGTTTTTATTACAAAAACCTACCCTAACCATTATACTTTGGTAACTGGCCTCTTTG 
CAGAGAATCATGGGATTGTTGCAAATGATATGTTTGATCCTATTCGGAACAAATCTTTCTCC 
TTGGATCACATGAATATTTATGATTCCAAGTTTTGGGAAGAAGCGACACCAATATGGATCAC 
AAACCAGAGGGCAGGACATACTAGTGGTGCAGCCATGTGGCCCGGAACAGATGTAAAAATAC 
ATAAGCGCTTTCCTACTCATTACATGCCTTAC^TGAGTCAGTTTCATTTGAAGATAGAGTT 
GCCAAAATTGTTGAATGGTTTACGTCAAAAGAGCCCATAAATCTTGGTCTTCTCTATTGGGA 
AGACCCTGATGACATGGGCCACCATTTGGGACCTGACAGTCCGCTCATGGGGCCTGTCATTT 
CAGAT ATT GACAAGAAGTTAGGATATCT CATACAAATG CTGAAAAAGG CAAAGTTGTGGAAC 
ACTCTGAACCTAATCATCACAAGTGATCATGGAATGACGCAGTGCTCTGAGGAAAGGTTAAT 
AGAACTTGAC CAGTAC CTGGATAAAGACCACTATAC CCTGATTG ATCAAT CTCCAG TAGCAG 
CCATCTTGCCAAAAGAAGGTAAATTTGATGAAGTCTATGAAGCACTAACTCACGCTCATCCT 
AATCTTACTGTTTAC^^AAAAGAAGACGTTCCAGAAAGGTGG(^TT^ 

AATTCAACCAATCATAGCAGTGGCTGATGAAGGGTGGCACATTTTACAGAATAAGTCAGATG 
ACTTTCTGTTAGGCAACCACGGTTACGATAATGCGTTAGCAGATATGCATCCAATATTTTTA 
GCCCATGGTCCTGCCTTCAGAAAGAATTTCTCAAAAGAAGCCATGAACTCCACAGATTTGTA 
CCCACTACTATGCCACCTCCTCAATATCACTGCCATGCCACACAATGGATCATTCTGGAATG 
T C CAGG AT CT G CTCAATT CAG CAATG C CAAGGGT GGT C C CTTAT ACACAGAGTACTAT ACT C 
CTCCCTGGTAGTGTTAAACCAGCAGAATATGACCAAGAGGGGTCATACCCTTATTTCATAGG 
GGTCTCTCTTGGCAGCATTATAGTGATTGTATTTTTTGTAATTTTCATTAAGCATTTAATTC 
ACAGTCAAATACCTGCCTTACAAGATATGCATGCTGAAATAGCTCAACCATTATTACAAGCC 
TAATGTTACTTTGAAGTGGATTTGCATATTGAAGTGGAGATTCCATAATTATGTCAGTGTTT 
AAAGGTTTCAAATTCTGGGAAACCAGTTCCAAACATCTGCAGAAACCATTAAGCAGTTACAT 
ATTTAGGTATACACACACACACACACACACACATACACACACACGGACCAAAATACTTACAC 
CTGCAAAGGAATAAAGATGTGAGAGTATGTCTCCATTGTTCACTGTAGCATAGGGATAGATA 
AGATCCTGCTTTATTTGGACTTGGCGCAGATAATGTATATATTTAGCAACTTTGCACTATGT 
AAAGTACCTTATATATTGCACTTTAAATTTCTCTCCTGATGGGTACTTTAATTTGAAATGCA 
CTTTATGGACAGTTATGTCTTATAACTTGATTGAAAATGACAACTTTTTGCACCCATGTCAC 
AGAATACTTGTTACGCATTGTTCAAACTGAAGGAAATTTCTAATAATCCCGAATAATGAACA 
TAGAAATCTATCTCCATAAATTGAGAGAAGAAGAAGGTGATAAGTGTTGAAAATTAAATGTG 
ATAACCTTTGAACCTTGAATTTTGGAGATGTATTCCCAACAGCAGAATGCAACTGTGGGCAT 
TTCTTGTCTTATTTCTTTCCAGAGAACGTGGTTTTCATTTATTTTTCCCTCAAAAGAGAGTC 
AAATACTGACAGATTCGTTCTAAATATATTGTTTCTGTCATAAAATTATTGTGATTTCCTGA 
TGAGTCATATTACTGTGATTTTCATAATAATGAAGACACCATGAATATACTTTTCTTCTATA 
TAGTTCAGCAATGGCCTGAATAGAAGCAACCAGGCACCATCTCAGCAATGTTTTCTCTTGTT 
TGTAATTATT TG CT CCTTTGAAAATT AAATCACTATTAATTACATTAAAAATCAAATTGGAT 
AAAAAAAAAAAAAAAAAAA 
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FIGURE 202 

MTS KF I LVS F I LlAALS LSTTFS LQLDQQKVLLVS FDGFRWD YLYKVPTPHFHY I MKYGVHVK 
QVTNVFITKTYPNHYTLVTGLFAENHGIVANDMFDPIRNKSFSLDHMNIYDSKFWEEATPIW 
ITNQRAGHTSGAAMWPGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEPINLGLLY 
WEDPDDMGHHLGPDSPLMGPVISDIDKKLGYLIQMLKKAKLV^LNLIITSDHGMTQCSEER 
LIELIX3YLDKDHYTLIDQSPVAAILPKEGKFDEVYEALTHAHPNLTVYKKEDVPERWHYKYN 
SR I QP 1 1 AVADEGWHI LQNKSDDFLLGNHG YDNALADMHP I FLAHGPAFRKNFSKEAMNS TD 
LYPLLCHLLNITAMPHNGSFWNVQDLLNSAMPRVVPYTQSTILLPGSVKPAEYDQEGSYPYF 
IGVSLGSIIVIVFFVIFIKHLIHSQIPALQDMHAEIAQPLLQA 
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FIGURE 203 

GGATTTTTGTGATCCGCGATTCGCTCCCACGGGCGGGACCTTTGTAACTGCGGGAGGCCCAG 

GACAGGCCCACCCTGCGGGGCGGGAGGCAGCCGGGGTGAGGGAGGTGAAGAAACCAAGACGC 

AGAGAGGCCAAGCCCCTTGCCTTGGGTCACACAGCCAAAGGAGGCAGAGCCAGAACTCACAA 

CCAGATCCAGAGGCAACAGGGACATGGCCACCTGGGACGAAAAGGCAGTCACCCGCAGGGCC 

AAGGTGGCTCCCGCTGAGAGGATGAGCAAGTTCTTAAGGCACTTCACGGTCGTGGGAGACGA 

CTACCATGCCTGGAACATCAACTAC71AGAAATGGGAGAATGAAGAGGAGGAGGAGGAGGAGG 

AGCAGCCACCACCCACACCAGTCTCAGGCGAGGAAGGCAGAGCTGCAGCCCCTGACGTTGCC 

CCTGCCCCTGGCCCCGCACCCAGGGCCCCCCTTGACTTCAGGGGCATGTTGAG3AAACTGTT 

CAGCTCCCACA.GGTTTCAGGTCATCATCATCTGCTTGGTGGTTCTGGATGCCCTCCTGGTGC 

TTGCTGAGCTCATCCTGGACCTGAAGATCATCCAGCCCGACAAGAATAACTATGCTGCCATG 

GTATTCCACTACATGAGCATCACCATCTTGGTCTTTTTTATGATGGAGATCATCTTTAAATT 

ATTTGTCTTCCGCCTGAGTTCTTTCACCACAAGTTTGAGATCCTGGATGCCCGTCGTGGTGG 

TGGTCTCATTCATCCTGGACATTGTCCTCCTGTTCCAGGAGCACCAGTTTGAGGCTCTGGGC 

CTGCTGATTCTGCTCCGGCTGTGGCGGGTGGCCCGGATCATCAATGGGATTATCATCTCAGT 

TAAGACACGTTCAGAACGGCAACTCTTAAGGTTAAAACAGATGAATGTACAATTGGCCGCCA 

AGATTCAACACCTTGAGTTCAGCTGCTCTGAGAAGCCCCTGGACTGATGAGTTTGCTGTATC 

AACCTGTAAGGAGAAGCTCTCTCCGGATGGCTATGGGAATGAAAGAATCCGACTTCTACTCT 

CACACAGCCACCGTGAAAGTCCTGGAGTAAAATGTGCTGTGTACAGAAGAGAGAGAAGGAAG 

CAGGCTGGCATGTTCACTGGGCTGGTGTTACGACAGAGAACCTGACAGTCACTGGCCAGTTA 

T C AC TT CAGATT ACAAAT C A CACAGAGC AT CTG C CTGT TTTCAATCACAAGAG AACAAAACC 

AAAATCTATAAAGATATTCTGAAAATATGACAGAATTTGACAAATAAAAGCATAAACGTGTA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 204 

MATWDEKAVTRRACTAPAERMSKFLRHFTW^ 

SGEEGRAAAPDVAPAPGPAPRAPLDFRGMLRKLFSSHRFQVI 1 1 CLVVLD ALLVLAEL I LDL 
K 1 1 QPD KNNYAAMVFHYMS I TI LVFFMME 1 1 F KL FVFRLS S FTTSLRS WMPVWVVS F I LD I 
VLLFQEHQFEALGLLILLRLWRVARIINGIIISVKTRSERQLIJILKQMNVQLAAKIQHLEFS 
CSEKPLD 
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FIGURE 205 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCC^UVCAT 
GCTCTGTCTGTGCCTGTACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTG 
AGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCCATTTTCAAGCTCAGTGTCTTCATCCCCTCC 
CAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCTGGAGATAAGGACCT 
TGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGA 
GGCTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATG 
CAGTCCCTGCGGGACTTGGGAGTCAAGATATCTGAACAGCAGGCAGAAAAAATTCTCAAGAG 
CATGGATAAAAACGGCACGATGACCATCGACTGGAACGAGTGGAGAGACTACCACCTCCTCC 
ACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACGATCTTTGATGTG 
GGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTG 
GAGACACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGG 
ACAGGCTCAAGGTGCTCATGCAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGT 
GGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGGTCACTCTGGCGGGGCAATGGCATCAA 
CGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAGATCAAGCGCC 
TTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCA 
GGGGCCATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCG 
GAAGACAGGCCAGTACTCAGGAATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGG 
TGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTGGGCATCATCCCCTATGCCGGCATC 
GACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTGAACAGCGC 
GGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGG 
CCAGCTACCCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCT 
CCGGAGGTGACCATGAGCAGCCTCTTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCT 
GTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCAGCTGTGAGCATCAGCTACGTGG 
TCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGCCGCCCG 
GCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATG 
TGCCAACACTAAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGT 
GGGGAGAGCTGGCAGGCCCAGGGCTTGTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCC 
AGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGGCTCCGGGCTCACATGTGTAA 
GGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCTTAGT 
TCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGGCGTGC 
ATCTCCCTGTGCCCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAG 
CCCACATCCCACCCCCTCGTCCAATCCCATAATCCATGATGAAAGGTGAGGTCACGTGGCCT 
CCCAGGCCTGACTTCCCAACCTACAGCATTGACGCC7VACTTGGCTGTGAAGGAAGAGGAAAG 
GATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATGCT 
TGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCA 
TGGTGCTCTGAGCTGGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTG 
TCCCCACTGTGGCATGAGGGCAGTGGAGCACCATGTTTGAGGGCGAAGGGCAGAGCGTTTGT 
GTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGGACTGTTGGGAAAAGGG 
TTTTGTCCAGAAGGACAAGCCGGACAAATGAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACC 
CCAGCAGGGGCGCAGCGGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTT 
ATTTTGTATTTATTTGAACAGAGTTATGTCCTAACTATTTTTATAGATTTGTTTAATTAATA 
GCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTTGATTGTACCTTCCC 
AAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACAT 
CTGTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTG 
GCTCCTTTCCTTTGGCAGGTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGA 
CTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAATAACCTTGAAGGTGGAATCCAGTTATTTC 
CTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAGTGAGGTGCCTCT 
CACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCC 
CAGCCTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCC 
ATTCCACCAGAATGACCTGATGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATT 
GTTATATATGAACATATAACTGGAGTCGTCAAAAAGCAAATTAAGAAAGAATTGGACGTTAG 
AAGTTGTCATTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 206 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKS IFKLSVFI PSQEFSTYRQWKQKIVQAGDKD 
LDGQLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILK 
SMDKNGTMTIDWNEWRDYHLLHPVENI PE 1 1 LYWKHST I FDVGENLTVPDEFTVEERQTGMW 
WRHLVAGGGAGAVSRTCTAPLDRLKVLMQVHASRSNNMG I VGGFTQMI REGGARSLWRGNG I 
NVLKIAPESAIKFMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMAL 
RKTGQYSGMLDCARRI LAREGVAAFYKGYVPNMLGI I P Y AG I DLA VYE T L KN AWL Q HY A VN S 

ADPGVFVLIACGTMSSTCGQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKH 
LYRGLAPNFMKVI PAVS I S YWTENLKI TLGVQSR 
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FIGURE 207 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCAT 
GGCTTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAG 
CAATTGCACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTC 
GCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAA 
ACTTTCTGATATCGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCA 
AAGAAGGCAAAGATGAGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTT 
GCTGATCAAGTGATAGTTGGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGC 
TGGCACCTACAAATGTTATATCATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATA 
AAACTGGAGCCTTCAGCATGCCGGAAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTG 
CGGTGTGAGGCTCCCCGATGGTTCCCCCAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCA 
GGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAGCTGAACTCTGAGAATGTGACCA 
TGAAGGTTGTGTCTGTGCTCTACAATGTTACGATCAACAACACATACTCCTGTATGATTGAA 
AATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAGAATCGGAGATCAAAAGGCGGAG 
TCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGTCTCTTCTTTCTTTGCCATCAGCT 
GGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAATGTGCCTTGGCCACAAAAAAG 
CATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCACCACCAGATATGACCTAG 
TTTT ATATTT CTGGGAGGAAATGAATTCATAT CTAGAAGT CTGGAG TGAG CAAACAAGAGCA 
AGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAATCTATCTTCAAA 
GACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAG 
TAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCT 
GGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGCTG 
TAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCA 
CAAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 
GGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCT 
TGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAA 
ACAGAGCAGTCGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 208 

MASLGQILFWSIISIIIIIoAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDI 

KLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTD 

AGTYKCYIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTVWASQTO 

QGANFSEVSNTSFEUtfSENVTMKWSVLYNVTI^ 

SHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK 
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FI GURE 20 9 

GAATTTGTAGAAGACAGCGGCGTTGCCATGGCGGCGTCTCTGGGGCAGGTGTTGGCTCTGGT 
GCTGGTGGCCGCTCTGTGGGGTGGCACGCAGCCGCTGCTGAAGCGGGCCTCCGCCGGCCTGC 
AGCGGGTTCATGAGCCGACCTGGGCCCAGCAGTTGCTACAGGAGATGAAGACCCTCTTCTTG 
AATACTGAGTACCTGATGCCCTTTCTCCTCAACCAGTGTGGATCCCTTCTCTATTACCTCAC 
CTTGGCATCGACAGATCTGACCCTGGCTGTGCCCATCTGTAACTCTCTGGCTATCATCTTCA 
CACTGATTGTTGGGAAGGCCCTTGGAGAAGATATTGGTGGAAAACGTAAGTTAGACTACTGC 
GAGTGCGGGACGCAGCTCTGTGGATCTCGACATACCTGTGTTAGTTCCTTCCCAGAACCCAT 
CTCCCCAGAGTGGGTGAGGACACGGCCTTTTCCCATCCTGCCCTTTCCTCTGCAGCTGTTTT 
GCTTCCTTGTGGCCATCAGAGTTCCCTTCCCCTGGACAGTCTGGAGAAAGACAGAGGCTGGG 
GTTTGGGATTGAAGACCAGACCCCATCTGAGCCCTTCCTCCAGCCCTGTACCAGCTCCTACT 
GGCATGGCTGAGCTCAGACCCTCCTGATTTOTGCCTATTATCCCAGGAGCAGTTGCTGGCAT 
GGTGCTCACCGTGATAGGAATTTCACTCTGCATCACAAGCTCAGTGAGTAAGACCCAGGGGC 
AACAGTCTACCCTTTGAGTGGGCCGAACCCACTTCCAGCTCTGCTGCCTCCAGGAAGCCCCT 
GGGCCATGAAGTGCTGGCAGTGAGCGGATGGACCTAGCACTTCCCCTCTCTGGCCTTAGCTT 
CCTCCTCTCTTATGGGGATAACAGCTACCTCATGGATCACAATAAGAGAACAAGAGTGAAAG 
AGTTTTGTAACCTTCAAGTGCTGTTCAGCTGCGGGGATTTAGCACAGGAGACTCTACGCTCA 
CCCTCAGCAACCTTTCTGCCCCAGCAGCTCTCTTCCTGCTAACATCTCAGGCTCCCAGCCCA 
GCCACCATTACTGTGGCCTGATCTGGACTATCATGGTGGCAGGTTCCATGGACTGCAGAACT 
CCAGCTGCATGGAAAGGGCCAGCTGCAGACTTTGAGCCAGAAATGCAAACGGGAGGCCTCTG 
GGACTCAGTCAGAGCGCTTTGGCTGAATGAGGGGTGGAACCGAGGGAAGAAGGTGCGTCGGA 
GTGGCAGATGCAGGAAATGAGCTGTCTATTAGCCTTGCCTGCCCCACCCATGAGGTAGGCAG 
AAATCCTCACTGCCAGCCCCTCTTAAACAGGTAGAGAGCTGTGAGCCCCAGCCCCACCTGAC 
T C CAG C AC AC CTGG CG AG T AGT AG CTG TC AATAAATCT ATGTAAAC AG AC AAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 210 

MAASLGQVIALVLVAALWGGTQPLL 

L^QCGSLLYYLTLASTDLTLAVPI CNSLAI IFTL IVGKALGEDI GGKRKLDYCECGTQLiCGS 
RHTCVSSFPEPISPEWVRTRPFPILPFPLQLFCFLVAIRVPFPWTVWRKTEAGVWD 
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FIGURE 211 

CTTCTGTAGGACAGTCACCAGGCCAGATCCAGAAGCCTCTCTAGGCTCCAGCTTTCTCTGTG 

GAAGATGACAGCAATTATAGCAGGAC CCTG C C AGGCTG TC GAAAAG ATT CCG CAATAAAACT 

TTGCCAGTGGGAAGTACCTAGTGAAACGGCCTAAGATGCCACTTCTTCTCATGTCCCAGGCT 

TGAGGCCCTGTGGTCCCCATCCTTGGGAGAAGTCAGCTCCAGCACC^TGAAGGGCATCCTCG 

TTGCTGGTATCACTGCAGTGCTTGTTGCAGCTGTAGAATCTCTGAGCTGCGTGCAGTGTAAT 

TCATGGGAAAAATCCTGTGTCAACAGCATTGCCTCTGAATGTCCCTCACATGCCAACACCAG 

CTGTATCAGCTCCTCAGCCAGCTCCTCTCTAGAGACACCAGTCAGATTATACCAGAATATGT 

TCTGCTCAGCGGAGAACTGCAGTGAGGAGACACACATTACAGCCTTCACTGTCCACGTGTCT 

GCTGAAGAACACTTTCATTTTGTAAGCCAGTGCTGCCAAGGAAAGGAATGCAGCAACACCAG 

CGATGCCCTGGACCCTCCCCTGAAGAACGTGTCCAGCAACGCAGAGTGCCCTGCTTGTTATG 

AATCTAATGGAACTTCCTGTCGTGGGAAGCCCTGGAAATGCTATGAAGAAGAACAGTGTGTC 

TTTCTAGTTGCAGAACTTAAGAATGACT^TTGAGTCTAAGAGTCTCGTGCTGAAAGGCTGTTC 

CAACGTCAGTAACGCCACCTGTCAGTTCCTGTCTGGTGAAAACAAGACTCTTGGAGGAGTCA 

TCTTTCGAAAGTTTGAGTGTGCAAATGTAAACAGCTTAACCCCCACGTCTGCACCAACCACT 

TCCCACAACGTGGGCTCCAAAGCTTCCCTCTACCTCTTGGCCCTTGCCAGCCTCCTTCTTCG 

GGGACTGCTGCCCTGAGGTCCTGGGGCTGCACTTTGCCCAGCACCCCATTTCTGCTTCTCTG 

AGGTCCAGAGCACCCCCTGCGGTGCTGACACCCTCTTTCCCTGCTCTGCCCCGTTTAACTGC 

CCAGTAAGTGGGAGTCACAGGTCTCCAGGCAATGCCGACAGCTGCCTTGTTCTTCATTATTA 

AAGCACTGGTTCATTCACTGCCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 212 

MKGILVAGITAVLVAAVESLSCVQCNSWEKSCVNSIASECPSHANTSCISSSASSSLETPVR 
LYQNMFCSAENCSEETKITAFTVHVSAEEHFHFVSQCCQGKECSNTSDALDPPLKNVSSNAE 
CPACOTSNGTSCRGKPWKCYEEEQCVFLVAELKNDIESKSLVLKGCSNVSNATCQFLSGENK 
TLGGVIFRKFECANVNSLTPTSAPTTSHNVGSKASLYLLALASLLLRGLLP 
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FIGURE 213 

GGCCTCGGTTCAAACGACCCGGTGGGTCTACAGCGGAAGGGAGGGAGCGAAGGTAGGAGGCA 

GGGCTTGCCTCACTGGCCACCCTCCCAACCCCAAGAGCCCAGCCCCATGGTCCCCGCCGCCG 

GCGCGCTGCTGTGGGTCCTGCTGCTGAATCTGGGTCCCCGGGCGGCGGGGGCCCAAGGCCTG 

ACCCAGACTCCGACCGAAATGCAGCGGGTCAGTTTACGCTTTGGGGGCCCCATGACCCGCAG 

CTACCGGAGCACCGCCCGGACTGGTCTTCCCCGGAAGACAAGGATAATCCTAGAGGACGAGA 

ATGATGCCATGGCCGACGCCGACCGCCTGGCTGGACCAGCGGCTGCCGAGCTCTTGGCCGCC 

ACGGTGTCCACCGGCTTTAGCCGGTCGTCCGCCATTAACGAGGAGGATGGGTCTTCAGAAGA 

GGGGGTTGTGATTAATGCCGGAAAGGATAGCACCAGCAGAGAGCTTCCCAGTGCGACTCCCA 

ATACAGCGGGGAGTTCCAGCACGAGGTTTATAGCCAATAGTCAGGAGCCTGAAATCAGGCTG 

ACTTCAAGCCTGCCGCGCTCCCCCGGGAGGTCTACTGAGGACCTGCCAGGCTCGCAGGCCAC 

CCTGAGCCAGTGGTCCACACCTGGGTCTACCCCGAGCCGGTGGCCGTCACCCTCACCCACAG 

CCATGCCATCTCCTGAGGATCTGCGGCTGGTGCTGATGCCCTGGGGCCCGTGGCACTGCCAC 

TGCAAGTCGGGCACCATGAGCCGGAGCCGGTCTGGGAAGCTGCACGGCCTTTCCGGGCGCCT 

TCGAGTTGGGGCGCTGAGCCAGCTCCGCACGGAGCACAAGCCTTGCACCTATCAACAATGTC 

CCTGCAACCGACTTCGGGAAGAGTGCCCCCTGGACACAAGTCTCTGTACTGACACCAACTGT 

GCCTCTCAGAGCACCACCAGTACCAGGACCACCACTACCCCCTTCCCCACCATCCACCTCAG 

AAGCAGTCCCAGCCTGCCACCCGCCAGCCCCTGCCCAGCCCTGGCTTTTTGGAAACGGGTCA 

GGATTGGCCTGGAGGATATTTGGAATAGCCTCTCTTCAGTGTTCACAGAGATGCAACCAATA 

GACAGAAACCAGAGGTAATGGCCACTTCATCCACATGAGGAGATGTCAGTATCTCAACCTCT 

CTTGCCCTTTCAATCCTAGCACCCACTAGATATTTTTAGTACAGAAAAACAAAACTGGAAAA 
CACAA 
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FIGURE 214 

WPAAGALLWVLLLNLGPRAAGAQGLTQTPTEMQRVSLRFGGPMTRSYRSTARTGLPRKTRI 
ILEDENDAMADADRLAGPAAAELLAATVSTGFSRSSAINEEDGSSEEGWINAGKDSTSREL 
PSATPNTAGSSSTRFIANSQEPEIRLTSSLPRSPGRSTEDLPGSQATLSQWSTPGSTPSRWP 
SPSPTAMPSPEDLRLVLMPWGPWHCHCKSGTMSRSRSGKLHGLSGRLRVGALSQLRTEHKPC 
TYQQCPCNRLREECPLDTSLCTDTNCASQSTTSTRTTTTPFPTIHLRSSPSLPPASPCPALA 
FWKRVRIGLEDIWNSLSSVFTEMQPIDRNQR 
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FIGURE 215 

CCCGGGTCGACCCACGCGTCCGGGGAGAAAGGATGGCCGGCCTGGCGGCGCGGTTGGTCCTG 
CTAGCTGGGGCAGCGGCGCTGGCGAGCGGCTCCCAGGGCGACCGTGAGCCGGTGTACCGCGA 
CTGCGTACTGCAGTGCGAAGAGCAGAACTGCTCTGGGGGCGCTCTGAATCACTTCCGCTCCC 
GCCAGCCAATCTACATGAGTCTAGCAGGCTGGACCTGTCGGGACGACTGTAAGTATGAGTGT 
ATGTGGGTCACCGTTGGGCTCTACCTCCAGGAAGGTCACAAAGTGCCTCAGTTCCATGGCAA 
GTGGCCCTTCTCCCGGTTCCTGTTCTTTCAAGAGCCGGCATCGGCCGTGGCCTCGTTTCTCA 
ATGGCCTGGCCAGCCTGGTGATGCTCTGCCGCTACCGCACCTTCGTGCCAGCCTCCTCCCCC 
ATGTACCACACCTGTGTGGCCTTCGCCTGGGTGTCCCTCAATGCATGGTTCTGGTCCACAGT 
CTTCCACACCAGGGACACTGACCTCACAGAGAAAATGGACTACTTCTGTGCCTCCACTGTCA 
TCCTACACTCAATCTACCTGTGCTGCGTCAGGACCGTGGGGCTGCAGCACCCAGCTGTGGTC 
AGTGCCTTCCGGGCTCTCCTGCTGCTCATGCTGACCGTGCACGTCTCCTACCTGAGCCTCAT 
CCGCTTCGACTATGGCTACAACCTGGTGGCCAACGTGGCTATTGGCCTGGTCAACGTGGTGT 
GGTGGCTGGCCTGGTGCCTGTGGAACCAGCGGCGGCTGCCTCACGTGCGCAAGTGCGTGGTG 
GTGGTCTTGCTGCTGCAGGGGCTGTCCCTGCTCGAGCTGCTTGACTTCCCACCGCTCTTCTG 
GGTCCTGGATGCCCATGCCATCTGGCACATCAGCACCATCCCTGTCCACGTCCTCTTTTTCA 
G C TTTC TGGAAG ATGACAG CCTG TAC CTG CTGAAGGAAT CAG AGG ACAAGTT CAAG CTGG A C 
TGAAGACCTTGGAGCGAGTCTGCCCCAGTGGGGATCCTGCCCCCGCCCTGCTGGCCTCCCTT 
CTCCCCTCAACCCTTGAGATGATTTTCTCTTTTCAACTTCTTGAACTTGGACATGAAGGATG 
TGGGCCCAGAATCATGTGGCCAGCCCACCCCCTGTTGGCCCTCACCAGCCTTGGAGTCTGTT 
CTAGGGAAGGCCTCCCAGCATCTGGGACTCGAGAGTGGGCAGCCCCTCTACCTCCTGGAGCT 
GAACTGGGGTGGAACTGAGTGTGTTCTTAGCTCTACCGGGAGGACAGCTGCCTGTTTCCTCC 
CCACCAGCCTCCTCCCCACATCCCCAGCTGCCTGGCTGGGTCCTGAAGCCCTCTGTCTACCT 
GGGAGACCAGGGACCACAGGCCTTAGGGATACAGGGGGTCCCCTTCTGTTACCACCCCCCAC 
CCTCCTCCAGGACACCACTAGGTGGTGCTGGATGCTTGTTCTTTGGCCAGCCAAGGTTCACG 
GCGATTCTCCCCATGGGATCTTGAGGGACCAAGCTGCTGGGATTGGGAAGGAGTTTCACCCT 
GACCGTTGCCCTAGCCAGGTTCCCAGGAGGCCTCACCATACTCCCTTTCAGGGCCAGGGCTC 
CAGCAAGCCCAGGGCAAGGATCCTGTGCTGCTGTCTGGTTGAGAGCCTGCCACCGTGTGTCG 
GGAGTGTGGGCCAGGCTGAGTGCATAGGTGACAGGGCCGTGAGCATGGGCCTGGGTGTGTGT 
GAGCTCAGGCCTAGGTGCGCAGTGTGGAGACGGGTGTTGTCGGGGAAGAGGTGTGGCTTCAA 
AGTGTGTGTGTGCAGGGGGTGGGTGTGTTAGCGTGGGTTAGGGGAACGTGTGTGCGCGTGCT 
GGTGGGCATGTGAGATGAGTGACTGCCGGTGAATGTGTCCACAGTTGAGAGGTTGGAGCAGG 
ATGAGGGAATCCTGTCACCATCAATAATCACTTGTGGAGCGCCAGCTCTGCCCAAGACGCCA 
CCTGGGCGGACAGCCAGGAGCTCTCCATGGCCAGGCTGCCTGTGTGCATGTTCCCTGTCTGG 
TGCCCCTTTGCCCGCCTCCTGCAAACCTCACAGGGTCCCCACACAACAGTGCCCTCCAGAAG 
CAGCCCCTCGGAGGCAGAGGAAGGAAAATGGGGATGGCTGGGGCTCTCTCCATCCTCCTTTT 
CTCCTTGCCTTCGCATGGCTGGCCTTCCCCTCCAAAACCTCCATTCCCCTGCTGCCAGCCCC 
TTTGCCATAGCCTGATTTTGGGGAGGAGGAAGGGGCGATTTGAGGGAGAAGGGGAGAAAGCT 
TATGGCTGGGTCTGGTTTCTTCCCTTCCCAGAGGGTCTTACTGTTCCAGGGTGGCCCCAGGG 
CAGGCAGGGGCCACACTATGCCTGTGCCCTGGTAAAGGTGACCCCTGCCATTTACCAGCAGC 
CCTGGCATGTTCCTGCCCCACAGGAATAGAATGGAGGGAGCTCCAGAAACTTTCCATCCCAA 
AGGCAGTCTCCGTGG7TGAAGCAGACTGGATTTTTGCTCTGCCCCTGACCCCTTGTCCCTCT 
TTGAGGGAGGGGAGCTATGCTAGGACTCCAACCTCAGGGACTCGGGTGGCCTGCGCTAGCTT 
CTTTTGATACTGAAAACTTTTAAGGTGGGAGGGTGGCAAGGGATGTGCTTAATAAATCAATT 
CCAAGCCTCAAAAAAAAAAAAAAAAA 
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FIGURE 216 

MAGLAARLVLLAGAAA1ASGSQ 

TCRDDCKYECMWVTVGLYLQSGHKVPQFHGKWPFSRFLFFQEPASAVASFLNGLASLVMLCR 
YRTFVPASSPMYHTCVAFAWSLNAWFWSTVFHTRDTDLTEKMDYFCASTVI LHS I YLCCVR 
TVGLQHPAWSAFRALLLLMLTVHVSYLSLIRFDYGYNL^^ 

RLPHVRKCWWLLLQGLSLLELLDFPPLFWVLDAHAIWHISTIPVHVLFFSFLEDDSLYLL 
KESEDKFKLD 



FIGURE 217 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGCCGGAGGCCGAAGGTCCGTGA 
CTATGGCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGGGCTG 
CTGATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTGAC 
CAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAGGAGC 
TGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAGCCAGGG 
CAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGAGAGGCAAA 
ACTCCAATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCTGGATATCAACACCA 
ACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAGGGGGCAGAGATG 

G AG AGTT C AAAGG AAG AC AAGG CAAGG C AGGCTG AGGT AAAG CG GCT CTT C CG C CC CATTG A 
GGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACTGACATGCAGATCATGG 
TACGGCTGATCAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAAGAGAAGATTGCTGCGCTC 
TTTGATCTTGAATATTATGTCCATCAGATGGACAATGCGCAGGACCTGCTTTCCTTTGGTGG 
TCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCCCTCGTGAAGGAGTATGCTGCGT 
TTGTGCTGGGCGCTGCCTTTTCCAGCAACCCCAAGGTCCAGGTGGAGGCCATCGAAGGGGGA 
GCCCTGCAGAAGCTGCTGGTCATCCTGGCCACGGAGCAGCCGCTCACTGCAAAGAAGAAGGT 
CCTGTTTGCACTGTGCTCCCTGCTGCGCCACTTCCCCTATGCCCAGCGGCAGTTCCTGAAGC 
TCGGGGGGCTGCAGGTCCTGAGGACCCTGGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTG 
CGCGTGGTCACACTGCTCTACGACCTGGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGA 
GCTGACCCAGGAGATGTCCCCAGAGAAGCTGCAGCAGTATCGCCAGGTACACCTCCTGCCAG 
GCCTGTGGGAACAGGGCTGGTGCGAGATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGAT 
GCCCGTGAGAAGGTGCTGCAGACACTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCG 
TCAGGACCCCCAGCTCGGCAGGACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCA 
GCCTGGAGCTGCAGGATGGTGAGGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAAC 
AGCTTGCTGAAGGAGCTGAGATGAGGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGA 
GG CT GAGGGGT G CCAG CGTGGGTGGGCTT CTC AGG C AGGAGGAC AT CTTGGC AGTG CTGGCT 
TGGCCATTAAATGGAAACCTGAAGGCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 218 

MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETPCAEEEL 
DAEVLEWHPTHEWQALQPGQAVPAGSHVRLNLQTGEREAKXQYEDKFRNNLKGKRLDINTN 
TYTSQDLKSALAKFKEGAEMESSKEDKARQAEVKRLFRPIEEL^ 

RLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQWINGLNSTEPLVKEYAAF 
VLGAAFSSNPKVQVEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHFPYAQRQFLKL 
GGLQVLRTLVQEKGTEVLAVRVVTLLYDLVTEKMFAEEEAELTQEMSPEKLQQYRQVHLLPG 
LWEQGWCE I TAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGRTIiASLQAE YQVLAS 
LELQDGEDEGYFQELLGSVNSLLKELR 
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FIGURE 219 

TTCGGCTTCCGTAGAGGAAGTGGCGCGGACCTTCATTTGGGGTTTCGGTTCCCCCCCTTCCC 

CTTCCCCGGGGTCTGGGGGTGACATTGCACCGCGCCCCTCGTGGGGTCGCGTTGCCACCCCA 

CGCGGACTCCCCAGCTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCC 

TTCCCACCTGACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGC 

CCGGCCTTCGCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGT 

CGCAGGGGCATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGG 

TCCATGTGACCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCT 

GTCTCTGTCCTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGA 

TGAAGGGTTAGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCT 

ATGTTTCTGGTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCT 

GATGCACTTGGGCCAGGTGTGGTTGGGATCCATGGAGACTCACCCTATTACTTCCTGACTTC 

AG CCTTT CTG ACAG CAGC CATT AT CCTG CTCCATACCTTTTGGGGAGTTGTGTT CTTTGATG 

CCTGTGAGAGGAGACGGTACTGGGCTTTGGGCCTGGTGGTTGGGAGTCACCTACTGACATCG 

GGACTGACATTCCTGAACCCCTGGTATGAGGCCAGCCTGCTGCCCATCTATGCAGTCACTGT 

TTCCATGGGGCTCTGGGCCTTCATCACAGCTGGAGGGTCCCTCCGAAGTATTCAGCGCAGCC 

TCTTGTGTAAGGACTGACTACCTGGACTGATCGCCTGACAGA7CCCACCTGCCTGTCCACTG 

CCCATGACTGAGCCCAGCCCCAGCCCGGGTCCATTGCCCACATTCTCTGTCTCCTTCTCGTC 

GGTCTACCCCACTACCTCCAGGGTTTTGCTTTGTCCTTTTGTGACCGTTAGTCTCTAAGCTT 

TACCAGGAGCAGCCTGGGTTCAGCCAGTCAGTGACTGGTGGGTTTGAATCTGCACTTATCCC 

CACCACCTGGGGACCCCCTTGTTGTGTCCAGGACTCCCCCTGTGTCAGTGCTCTGCTCTCAC 

CCTGCCCAAGACTCACCTCCCTTCCCCTCTGCAGGCCGACGGCAGGAGGACAGTCGGGTGAT 

GGTGTATTCTGCCCTGCGCATCCCACCCGAGGACTGAGGGAACCTAGGGGGGACCCCTGGGC 

CTGGGGTGCCCTCCTGATGTCCTCGCCCTGTATTTCTCCATCTCCAGTTCTGGACAGTGCAG 

GTTGCCAAGAAAAGGGACCTAGTTTAGCCATTGCCCTGGAGATGAAATTAATGGAGGCTCAA 

GGATAGATGAGCTCTGAGTTTCTCAGTACTCCCTCAAGACTGGACATCTTGGTCTTTTTCTC 

AGGCCTGAGGGGGAACCATTTTTGGTGTGATAAATACCCTAAACTGCCTTTTTTTCTTTTTT 

GAGGTGGGGGGAGGGAGGAGGTATATTGGAACTCTTCTAACCTCCTTGGGCTATATTTTCTC 

TCCTCGAGTTGCTCCTCATGGCTGGGCTCATTTCGGTCCCTTTCTCCTTGGTCCCAGACCTT 

GGGGGAAAGGAAGGAAGTGCATGTTTGGGAACTGGCATTACTGGAACTAATGGTTTTAACCT 

CCTTAACCACCAGCATCCCTCCTCTCCCCAAGGTGAAGTGGAGGGTGCTGTGGTGAGCTGGC 

CACTCCAGAGCTGCAGTGCCACTGGAGGAGTCAGACTACCATGACATCGTAGGGAAGGAGGG 

GAGATTTTTTTGTAGTTTTTAATTGGGGTGTGGGAGGGGCGGGGAGGTTTTCTATAAACTGT 

ATCATTTTCTGCTGAGGGTGGAGTGTCCCATCCTTTTAATCAAGGTGATTGTGATTTTGACT 

AATAAAAAAGAATTTGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 220 

MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLIASVWFILVHVTDR 
SDARLQYGLL I FGAAVSVLLQE VFRF AYYKLLKKADEGLASLSEDGRSP I S I RQMAYVSGLS 
FGIISGVFSVINILADALGPGWGIHGDSPYYFLTSAFLTAAIILLHTFWGWFFDACERRR 
YWALGLWGSHLLTSGLTFLNPWYEASLLPI YAVTVSMGLWAF I TAGGSLRS IQRSLLCKD 
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FIGURE 221 

AAGCTGGTTTAAGGAAGCAGAGGAGGGTTAGATTCGTTGAGTGAGGACGGAAGATCAACCCA 
TTTCCATTCCGCCAGATGGCCTATGTTTCTGGTCTCTCCCTTCGGNATCATCAGTGGTGTNT 
TOTCTGTTATCAATATTTTGGCTGATGCANTTGGGCCAGGTGTGGTTGGGATCCATGGAGAC 
TCACCCTATTANTTCCTGANTTCAGCCTTTNTGACAGCAGCCATTATCCTGCTC 
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FIGURE 222 

GACCGACCGTTCAGATGCCCGGTTCCAGTACGGCTTCCTGATTTTTGGTGCTGCTGTNTCTG 
TCCTTCTACAGGAGGTGTTCCGCTTTGCCTANTACAAGCTGCTTAAGAAGGCAGATGAGGGG 
TTAGCATNGCTGAGTGAGGACGGAAGATCACCCATTTCCATCCGCCAGATGGCCTATGTTTN 
TGGTNTTTCCTTCGGTATCATCAGTGGTGTTTTNTCTGTTATCAATATTTTGGNTGATGCAN 
TTGGGCCAGGTGTGGTTGGGATCCATGGAGANTCACCCTATTAATTCCTGAATTCAGCCTTT 
NTGACAGCAGCCATTATCCTGNTCCATACCTTTTGGGGAGTTGTGTTTTTTGATGCCTGTGA 
GAGGAG 
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FIGURE 223 

NGTTGGAGAAGTGGCGCGGACNTTCATTTGGGGTTTCGGTTTCCCCCCTTTCCCTTTCCCCG 
GGGTCTGGGGTGACATTGCACGGGCCCCTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCC 
CCAGNTGGNGCGCCCTTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACNTG 
ACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGCCCGGCCTTCG 
CGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGCA 
TTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGAC 
CGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTCC 
TTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTTA 
GCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTGG 
TCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTTG 
GGCCAGGTGTGGTTGGGATCCATGGAGACTCACCC 
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FIGURE 224 

GTAAAAGAAAGTGGCCGGACCTTCATTGGGGTTTCGGTTCCCCCCTTTCCCNTTCCCCGGGG 
TCTGGGGGTGACATTGCACCGCGCCCNTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCCC 
CAGNTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACCTGA 
CCAGCCATGGGGGCTGCGGTGTTTTTCGGGCTGCACTTTCGTCGCGTTCGGGCCCGGCCTTC 
GCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGC 
ATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGA 
CCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTC 
CTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTT 
AGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCG CCAGATGG CCTATGTTTCTG 

GTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTT 
GGGCCAGGTGTGGTTGGGATCCATGGAGAC 



FTGIJRE 225 

GCCCCAGGGAGCAGTGGGTGGTTATAACTCAGGCCCGGTGCCCAGAGCCCAGGAGGAGGCAG 
TGGCCAGGAAGGCACAGGCCTGAGAAGTCTGCGGCTGAGCTGGGAGCAAATCCCCCACCCCC 
TACCTGGGGGACAGGGCAAGTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGG 
TGTCTGTGCGTCCTGCACCCACATCTTTCTCTGTCCCCTCCTTGCCCTGTCTGGAGGCTGCT 
AGACTCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCGATGGTGGCCCGT 
CCTTGTGGTTCCTCTCTACCTGGGGAAATAAGGTGCAGCGGCCATOGCTACAGCAAGACCCC 
CCTGGATGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAGCATGTT 
CTCGCCAACAATGATGTTTCCTGTGACCACCCCTCTAACACCGTGCCCTCTGGGAGCAACCA 
GGACCTGGGAGCTGGGGCCGGGGAAGACGCCCGGTCGGATGACAGCAGCAGCCGCATCATCA 
ATGGATCCGACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCTGTTGCTAAGGCCCAAC 
CAGCTCTACTGCGGGGCGGTGTTGGTGCATCCACAGTGGCTGCTCACGGCCGCCCACTGCAG 
GAAGAAAGTTTTCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGC 
AGCAGATGTTCCAGGGGGTCAAATCCATCCCCCACCCTGGCTACTCCCACCCTGGCCACTCT 
AACGACCTCATGCTCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGATGTCAGACCCAT 
CAACGTCTCCTCTCATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAA 
CCAAGAGCCCCCAAGTGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAGT 
CAGAAAAGGTGCGAGGATGCTTACCCGAGACAGATAGATGACACCATGTTCTGCGCCGGTGA 
CAAAGCAGGTAGAGACTCCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCC 
TGCAGGGACTCGTGTCCTGGGGAGATTACCCTTGTGCCCGGCCCAACAGACCGGGTGTCTAC 
ACGAACCTCTGCAAGTTCACCAAGTGGATCCAGGAAACCATCCAGGCCAACTCCTGAGTCAT 
CCCAGGACTCAGCACACCGGCATCCCCACCTGCTGCAGGGACAGCCCTGACACTCCTTTCAG 
ACCCTCATTCCTTCCCAGAGATGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCT 
CCTGGACTCAGGGTCTGCTTCCCCCACATTGGGCTGACCGTGTCTCTCTAGTTGAACCCTGG 
GAACAATTTCCAAAACTGTCCAGGGCGGGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCAT 
C CT CAAGCTCAGGGC C CAT C C CTT CT CT G C AG CT CT GA C C C AAATT T AGT C C CAGAAAT AAA 
CTGAGAAGTGGAAAAAAAAA 
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FIGURE 226 

MATARP PWMWVL CAL I TALLLG VTEHVL ANND VS CDHP SNTVPS GSNQDLGAGAGE DARS DD 
SSSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLS 
PVYESGQQMFQGVKSIPHPGYSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCL 
VSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGP 
WCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 
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FIGURE 227 

ATGGTCAACGACCGGTGGAAGACCATGGGCGGCGCTGCCCAACTTGAGGACCGGCCGCGCGA 

^^ScCGCAG CGG C CG AG CTG CGG CT ACGTGCTGTG CACCGTGCTGCTGG CCCTGGCTGTGC 
TGCTGGCTGTAGCTGTCACCGGTGCCGTGCTCTTCCTGAACCACGCCCACGCGCC^GCACG 
GCGCCCCCACCTGTCGTCAGCACTGGGGCTGCCAGCGCCAACAGCGCCCTGGTCACTGTGGA 
AAGGGCC^ACAGCTCGCACCTCAGCATCCTCATTGACCCGCGCTGCCCCGACCT^CCGACA 
GCTTCGCACGCCTGGAGAGCGCCCAGGCCTCGGTGCTGCAGGCGCTGACAGAGCACCAGGCC 
CAGCCACGGCTGGTGGGCGACCAGGAGCAGGAGCTGCTGGACACGCTGGCCGACCAGCTGCC 
CCGGCTGCTGGCCCGAGCCTCAGAGCTGCAGACGGAGTGCATGGGGCTGCGGAAGGGGCATG 
GCACGCTGGGCCAGGGCCTCAGCGCCCTGCAGAGTGAGCAGGGCCGCCTCATCCAGCTTCTC 
TCTGAGAGCCAGGGCCACATGGCTCACCTGGTGAACTCCGTCAGCGACATCCTGGATGCCCT 
GCAGAGGGACCGGGGGCTGGGCCGGCCCCGCAACAAGGCCGACCTTCAGAGAGCGCCTGCCC 
GGGGAACCCGGCCCCGGGGCTGTGCCACTGGCTCCCGGCCCCGAGACTGTCTGGACGTCCTC 
CTAAGCGGACAGCAGGACGATGGCGTCTACTCTGTCTTTCCCACCCACTACCCGGCCGGCTT 
CCAGGTGTACTGTGACATGCGCACGGACGGCGGCGGCTGGACGGTGTTTCAGCGCCGGGAGG 
ACGGCTCCGTGAACTTCTTCCGGGGCTGGGACGCGTACCGAGACGGCTTTGGCAGGCTCACC 
GGGGAGCACTGGCTAGGGCTCAAGAGGATCCACGCCCTGACCACACAGGCTGCCTACGAGCT 
GCACGTGGACCTGGAGGACTTTGAGAATGGCACGGCCTATGCCCGCTACGGGAGCTTCGGCG 
TGGGCTTGTTCTCCGTGGACCCTGAGGAAGACGGGTACCCGCTCACCGTGGCTGACTATTCC 
GGCACTGCAGGCGACTCCCTCCTGAAGCACAGCGGCATGAGGTTCACCACCAAGGACCGTGA 
CAGCGACCATTCAGAGAACAACTGTGCCGCCTTCTACCGCGGTGCCTGGTGGTACCGCAACT 
GCCACACGTCCAACCTCAATGGGCAGTACCTGCGCGGTGCGCACGCCTCCTATGCCGACGGC 
GTGGAGTGGTCCTCCTGGACCGGCTGGCAGTACTCACTCAAGTTCTCTGAGATGAAGATCCG 
GCCGGTCCGGGAGGACCGCTAGACTGGTGCACCTTGTCCTTGGCCCTGCTGGTCCCTGTCGC 
CCCATCCCCGACCCCACCTCACTCTTTCGTGAATGTTCTCCACCCACCTGTGCCTGGCGGAC 
CCACTCTCCAGTAGGGAGGGGCCGGGCCATCCCTGACACGAAGCTCCCTGGGCCGGTGAAGT 
CACACATCGCCTTCTCGCCGTCCCCACCCCCTCCATTTGGCAGCTCACTGATCTCTTGCCTC 
TGCTGATGGGGGCTGGCAAACTTGACGACCCCAACTCCTGCCTGCCCCCACTGTGACTCCGG 
TGCTGTTTGCCGTCCCCTGGCCAGGATGGTGGAGTCTGCCCCAGGCACCCTCTGCCCTGCCC 
GGCCAAATACCCGGCATTATGGGGACAGAGAGCAGGGGGCAGACAGCACCCCTGGAGTCCTC 
CTAGCAGATCGTGGGGAATGTCAGGTCTCTCTGAGGTCAGGTCTGAGGCCAGTATCCTCCAG 
CCCTCCCAATGCCAACCCCCACCCCGTTTCCCTGGTGCCCAGAGAACCCACCTCTCCCCCAA 
GGGCCTCAGCCTGGCTGTGGGCTGGGTGGCCCCATCCTACCAGGCCCTGAGGTCAGGATGGG 
GAGCTGCTGCCTTTGGGGACCCACGCTCCAAGGCTGAGACCAGTTCCCTGGAGGCCACCCAC 
CCTGTGCCCCGGCAGGCCTGGGGTCTGCAGTCCTCTTACCTGCTGTGCCCACCTGCTCTCTG 
TCTCAAATGAGGCCCAACCCATCCCCCACCCAGCTCCCGGCCGTCCTCCTACCTGGGGCAGC 
CGGGGCTGCCATCCCATTTCTCCTGCCTCTGGAAGGTGGGTGGGGCCCTGCACCGTGGGGCT 
GGACTGCGCTAATGGGAAGCTCTTGGTTTTCTGGGCTGGGGCCTAGGCAGGGCTGGGATGAG 
GCTTGTACAACCCCCACCACCAATTTCCCAGGGACTCCAGGGTCCTGAGGCCTCCCAGGAGG 
GCCTTGGGGGTGATGACCCCTTCCCTGAGGTGGCTGTCTCCATGAGGAGGCCAACCCTTGCC 
ATTGACCGTGGCCACCTGGACCCAGGCCAGGCCCGGCCCGGCGAGTGGTCAAGGGACAGGGA 
CCACCTCACCGGGCAAATGGGGTCGGGGGGACTGGGGCACCAGACCAGGCACCACCTGGACA 
CTTTCTTGTTGAATCCTCCCAACACCCAGCACGCTGTCATCCCCACTCCTTGTGTGCACACA 
TGCAGAGGTGAGACCCGCAGGCTCCCAGGACCAGCAGCCACAAGGGCAGGGCTGGAGCCGGG 
TCCTCAGCTGTCTGCTCAGCAGCCCTGGACCCGCGTGCGTTACGTCAGGCCCAGATGCAGGG 
CGGCTTTTCCAAGGCCTCCTGATGGGGGCCTCCGAAAGGGCTGGAGTCAGCCTTGGGGAGCT 
GCCTAGCAGCCTCTCCTCGGGCAGGAGGGGAGGTGGCTTCCTCCAAAGGACACCCGATGGCA 
GGTGCCTAGGGGGTGTGGGGTTCCGTTCTCCCTTCCCCTCCCACTGAAGTTTGTGCTTAAAA 
AACAATAAATTTGACTTGGCACCACTGGGGGTTGGTGGGAGAGGCCGTGTGACCTGGCTCTC 
TGTCCCAGTGCCACCAGGTCATCCACATGCGCAG 
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FIGURE 228 

NT/NDRWKTMGGAAQLEDRPP.DKPQ 

APPPWSTGAASANSALVTVERADSSHLSILIDPRCPDLTDSFARLESAQASVLQALTEHQA 
QPRLVGDQEQELLDTLADQLPRLLARASELQTECMGLRKGHGTLGQGLSALQSEQGRLIQLL 
SESQGHMAHLVNSVSDILDALQRDRGLGRPRNKADLQRAPARGTRPRGCATGSRPRDCLDVL 
LSGQQDDGVYSVFPTHYPAGFQVYCDMRTDGGGWTVFQRREDGSVNFFRGWDAYRDGFGRLT 
GEHWLGLKRIHALTTQAAYELHVDLEDFENGTAYARYGSFGVGLFSVDPEEDGYPLTVADYS 
GTAGDSLLKHSGMRFTTKDRDSDHSENNCAAFYRGAWWYRNCHTSNLNGQYLRGAHASYADG 
VEWSSWTGWQYSLKFSEMKIRPVREDR 
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FIGURE 229 

GCAGTCAGAGACTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGCCT 

TGCTTCCTGAACTAGCTCACAGTAGCCCGGCGGCCCAGGGCAATCCGACCACATTTCACTCT 

CACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGCTGGATGATG 

ATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTGCCACAACTCGGCATCCAGAGCCC 

CGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGACCCTGCTGAC 

TTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTACTACC 

AGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATACGTCC 

CAAGAGTTGCAATCTCTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTCTGCAGCATGTGGC 

TGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGCACACAGGTGCAGCCCTTGTACAG 

AACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATAAAGACAGCAAAAGTTGGGAG 

GACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAAACAAGAAGA 

CCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTTTCTACTCTTATTGGACAGGGCTTT 

TGCGCCCTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCTGAACTG 

TT CCATATTATAATAGATGTCACCAGCC CAAGAAG CAG AGACTG TGTGGCCATCCTCAATGG 

GATGATCTTCTCAAAGGACTGCAAAGAATTGAAGCGTTGTGTCTGTGAGAGAAGGGCAGGAA 

TGGTGAAGCCAGAGAGCCTCCATGTCCCCCCTGAAACATTAGGCGAAGGTGACTG^TTCGCC 

CTCTGCAACTACAAATAGCAGAGTGAGCCAGGCGGTGCCAAAGCAAGGGCTAGTTGAGACAT 

TGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTG 

TTTCCTGTTCAGGATCACCAGCATTTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACA 

AGAAGTCTTATTTACATGCCACCAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTG 

GCTTAGAGATAACTTTTAGCTCTCTTTCTTCTCAATGTCTAATATCACCTCCCTGTTTTCAT 

GTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAGAGGAAATACATTGAGGTAAC 

ATCCTTTTCTCTGACAGTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAGATTGTACC 

AGCAAATACACAAGGAATTCTTTTTGTTTGTTTCAGTTCATACTAGTCCCTTCCCAATCCAT 

CAGTAAAGACCCCATCTGCCTTGTCCATGCCGTTTCCCAACAGGGATGTCACTTGATATGAG 

AATCTCAAATCTCAATGCCTTATAAGCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTG 

TCTCCCCTCCATAGGAATTTCTCCCAGGAAAGAAATATATCCCCATCTCCGTTTCATATCAG 

AACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGTGAGCCTCTTCA 

TCTGCACCTGTAATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAG 

GTACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 
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FIGU R E 230 

MQAKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLL 
IGLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLOHVAEKLCRE 
LYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAAS 
QSYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHIIIDVTSPRSRDCVAILNGMIFSKD 
CKELKRCVCERRAGMVKPESLHVPPETLGEGD 
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FIGURE 231 

AATTTTCACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGNTGG 
ATGATGATGGGACACCACCATGAGCCTGCATTNTCAAGCTTTTGCCACAATTCGGCATCCAG 
AGCCCCGGCGCACAGAGCACAGGGNTCCTTTTTCAACGTGGCGACCAGTGGCCCTGACCCTG 
CTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTA 
CTACCAGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATA 
CGTCCCAAGAGTTGCAATTTNTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTNTGCAGCAT 
GTGGCTGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGGAACTTTGAAGGAGGGCAA 
AGTNTCCTCATNTACTATACACACACCACTTCCC 
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FIGURE 232 

GCCGAGCGCAAGAACCCTGCGCAGCCCAGAGCAGCTGCTGGAGGGGAATCGAGGCGCGGCTC 
CGGGGATTCGGCTCGGGCCGCTGGCTCTGCTCTGCGGGGAGGGAGCGGGCCCGCCCGCGGGG 
CCCGAGCCCTCCGGATCCGCCCCCTCCCCGGTCCCGCCCCCTCGGAGACTCCTCTGGCTGCT 
CTGGGGGTTCGCCGGGGCCGGGGACCCGCGGTCCGGGCGCCATSCGGGCATCGCTGCTGCTG 
TCGGTGCTGCGGCCCGCAGGGCCCGTGGCCGTGGGCATCTCCCTGGGCTTCACCCTGAGCCT 
GCTCAGCGTCACCTGGGTGGAGGAGCCGTGCGGCCCAGGCCCGCCCCAACCTGGAGACTCTG 
AGCTGCCGCCGCGCGGCAACACCAACGCGGCGCGCCGGCCCAACTCGGTGCAGCCCGGAGCG 
GAGCGCGAGAAGCCCGGGGCCGGCGAAGGCGCCGGGGAGAATTGGGAGCCGCGCGTCTTGCC 
CTACCACCCTGCACAGCCCGGCCAGGCCGCCAAAAAGGCCGTCAGGACCCGCTACATCAGCA 
CGGAGCTGGGCATCAGGCAGAGGCTGCTGGTGGCGGTGCTGACCTCTCAGACCACGCTGCCC 
ACGCTGGGCGTGGCCGTGAACCGCACGCTGGGGCACCGGCTGGAGCGTGTGGTGTTCCTGAC 
GGGCGCACGGGGCCGCCGGGCCCCACCTGGCATGGCAGTGGTGACGCTGGGCGAGGAGCGAC 
CCATTGGACACCTGCACCTGGCGCTGCGCCACCTGCTGGAGCAGCACGGCGACGACTTTGAC 
TGGTTCTTCCTGGTGCCTGACACCACCTACACCGAGGCGCACGGCCTGGCACGCCTAACTGG 
CCACCTCAGCCTGGCCTCCGCCGCCCACCTGTACCTGGGCCGGCCCCAGGACTTCATCGGCG 
GAGAGCCCACCCCCGGCCGCTACTGCCACGGAGGCTTTGGGGTGCTGCTGTCGCGCATGCTG 
CTGCAACAACTGCGCCCCGACCTGGAAGGCTGCCGCAACGACATCGTCAGTGCGCGCCCTGA 
CGAGTGGCTGGGTCGCTGCATTCTCGATGCCACCGGGGTGGGCTGCACTGGTGACCACGAGG 
GGGTGCACTATAGCCATCTGGAGCTGAGCCCTGGGGAGCCAGTGCAGGAGGGGGACCCTCAT 
TTCCGAAGTGCCCTGACAGCCCACCCTGTGCGTGACCCTGTGCACATGTACCAGCTGCACAA 
AGCTTTCGCCCGAGCTGAACTGGAACGCACGTACCAGGAGATCCAGGAGTTACAGTGGGAGA 
TCCAGAATACCAGCCATCTGGCCGTTGATGGGGACCGGGCAGCTGCTTGGCCCGTGGGTATT 
CCAGCACCATCCCGCCCGGCCTCCCGCTTTGAGGTGCTGCGCTGGGACTACTTCACGGAGCA 
GCACGCTTTCTCCTGCGCCGATGGCTCACCCCGCTGCCCACTGCGTGGGGCTGACCGGGCTG 
ATGTGGCCGATGTTCTGGGGACAGCTCTAGAGGAGCTGAACCGCCGCTACCACCCGGCCTTG 
CGGCTCCAGAAGCAGCAGCTGGTGAATGGCTACCGACGCTTTGATCCGGCCCGGGGTATGGA 
ATACACGCTGGACTTGCAGCTGGAGGCACTGACCCCCCAGGGAGGCCGCCGGCCCCTCACTC 
GCCGAGTGCAGCTGCTCCGGCCGCTGAGCCGCGTGGAGATCTTGCCTGTGCCCTATGTCACT 
GAGGCCTCACGTCTCACTGTGCTGCTGCCTCTAGCTGCGGCTGAGCGTGACCTGGCCCCTGG 
CTTCTTGGAGGCCTTTGCCACTGCAGCACTGGAGCCTGGTGATGCTGCGGCAGCCCTGACCC 
TGCTGCTACTGTATGAGCCGCGCCAGGCCCAGCGCGTGGCCCATGCAGATGTCTTCGCACCT 
GTCAAGGCCCACGTGGCAGAGCTGGAGCGGCGTTTCCCCGGTGCCCGGGTGCCATGGCTCAG 
TGTGCAGACAGCCGCACCCTCACCACTGCGCCTCATGGATCTACTCTCCAAGAAGCACCCGC 
TGGACACACTGTTCCTGCTGGCCGGGCCAGACACGGTGCTCACGCCTGACTTCCTGAACCGC 
TGCCGCATGCATGCCATCTCCGGCTGGCAGGCCTTCTTTCCCATGCATTTCCAAGCCTTCCA 
CCCAGGTGTGGCCCCACCACAAGGGCCTGGGCCCCCAGAGCTGGGCCGTGACACTGGCCGCT 
TTGATCGCCAGGCAGCCAGCGAGGCCTGCTTCTACAACTCCGACTACGTGGCAGCCCGTGGG 
CGCCTGGCGGCAGCCTCAGAACAAGAAGAGGAGCTGCTGGAGAGCCTGGATGTGTACGAGCT 
GTTCCTCCACTTCTCCAGTCTGCATGTGCTGCGGGCGGTGGAGCCGGCGCTGCTGCAGCGCT 
ACCGGGCCCAGACGTGCAGCGCGAGGCTCAGTGAGGACCTGTACCACCGCTGCCTCCAGAGC 
GTGCTTGAGGGCCTCGGCTCCCGAACCCAGCTGGCCATGCTACTCTTTGAACAGGAGCAGGG 
CAACAGCACCTGACCCCACCCTGTCCCCGTGGGCCGTGGCATGGCCACACCCCACCCCACTT 
CTCCCCCAAAACCAGAGCGACCTGCCAGCCTCGCTGGGCAGGGCTGGCCGTAGCCAGACCCC 
AAGCTGGCCCACTGGTCCCCTCTCTGGCTCTGTGGGTCCCTGGGCTCTGGACAAGCACTGGG 
GGACGTGCCCCCAGAGCCACCCACTTCTCATCCCAAACCCAGTTTCCCTGCCCCCTGACGCT 
GCTGATTCGGGCTGTGGCCTCCACGTATTTATGCAGTACAGTCTGCCTGACGCCAGCCCTGC 
CTCTGGGCCCTGGGGGCTGGGCTGTAGAAGAGTTGTTGGGGAAGGAGGGAGCTGAGGAGGGG 
GCATCTCCCAACTTCTCCCTTTTGGACCCTGCCGAAGCTCCCTGCCTTTAATAAAC7GGCCA 
AGTGTGGAAAAA 
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FIGURE 233 

MRASLLLSVLRPAGPVAVGISLGFTLSLLSVTWVEEPCGPGPPQPGDSELPPRGNTNAARRP 
NSVQPGAEREKPGAGEGAGENWEPRVLP YHPAQPGQAAKKAVRTRY I STELG IRQRLLVAVL 
TSQTTLPTLGVAVNRTLGHRLERWFLTGARGRRAPPGMAWTLGEERPIGHUILALRHLLE 
QHGDDFDWFFLVPDTTYTEAHGLARLTGHLSLAS AAHLYLGRPQDF I GGEPTPGRYCHGGFG 
VLLSRKLLQQLRPHLEGCRNDIVSARPDEWLGRCILDATGVGCTGDHEGVHYSHLELSPGEP 
VQEGDPHFRSALTAHPWDPTOMYQLHKAFARAELERTYQEIQELQWEIQNTSHLAVDGDRA 
AAWPVGIPAPSRPASRFEVLRWDYFTEQHAFSCADGSPRCPLRGADRADVADVLGTALEELN 
RRYHPALRLQKQQLVNGYRRFDPARGMEYTLDLQLEALTPQGGRRPLTRRVQLLRPLSRVEI 
LPVPYVTEASRLTVLLPIJU^RDLAPGFLEAFATAALEPGDAAAALTLLLLYEPRQAQRVA 
HADVTAPVKAHVAELERRFPGARVPWLSVQTAAPSPLRLMDLLSKKHPLDTLFLLAGPDTVL 
TPDFLNRCRMHAISGWQAFFPMHFQAFHPGVAPPQGPGPPELGRDTGRFDRQAASEACFYNS 
DYVAARGRLAAASEQEEELLESLDVYELFLHFSSLHVLRAVEPALLQRYRAQTCSARLSEDL 
YHRCLQSVLEGLGSRTQLAMLLFEQEQGNST 
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GCTCTGGCCGGCCCCGGCGATTGGTCACCGCCCGCTAGGGGACAGCCCTGGCCTCCTCTGAT 

TGGCAAGCGCTGGCCACCTCCCCACACCCCTTGCGAACGCTCCCCTAGTGGAGAAAAGGAGT 

AGCTATTAGCCAATTCGGCAGGGCCCGCTTTTTAGAAGCTTGATTTCCTTTGAAGATGAAAG 

ACTAGCGGAAGCTCTGCCTCTTTCCCCAGTGGGCGAGGGAACTCGGGGCGATTGGCTGGGAA 

CTGTATCCACCCAAATGTCACCGATTTCTTCCTATGCAGGAAATGAGCAGACCCATCAATAA 

GAAATTTCTCAGCCTGGCCGAAAATGGTTGGCCCCACGAAGCCACGACAACTGGAGGCAAAG 

AGGGTTGCTCAACGCCCCGCCTCATTGGAAAACCAAATCAGATCTGGGACCTATATAGCGTG 

GCGGAGGCGGGGCGATGATTGTCGCGCTCGCACCCACTGCAGCTGCGCACAGTCGCATTTCT 

TTCCCCGCCCCTGAGACCCTGCAGCACCATCTGTC^TGGCGGCTGGGCTGTTTGGTTTGAGC 

GCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGCTCCCGGCCGCCCGCGTCCGCTGGGA 

ATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGTGGCGGGAAAGCGGCCCCCAGAAC 

CGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGACGAAAACTTGTATGAGAAGAACCCA 

GACTCCCATGGTTATGACAAGGACCCCGTTTTGGACGTCTGGAACATGCGACTTGTCTTCTT 

CTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACCTTTGTGGCCTATCTGCCTGACTACA 

GGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCTTGTGAAATACCGAGAGGCCAATGGC 

CTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCAAGATCCAGCTGCCAGAGGATGAGTG 

ACCAGTTGCTAAGTGGGGCTCAAGAAGCACCGCCTTCCCCACCCCCTGCCTGCCATTCTGAC 

CTCTTCTCAGAGCACCTAATTAAAGGGGCTGAAAGTCTGAA 
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FIGURE 235 

MAAGLFGL.SARRLLAAAATRGLPAARVRWESSFSRTWAPSAVAGKRPPEPTTPWQEDPEPE 
DENLYEKNPDSHGYDKDPVLDVWNMRLVFFFGVS 1 1 LVLGSTFVAYLPDYRMKEWSRREAER 
LVKYREANGLPIMESNCFDPSKIQLPEDE 
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GGCGGCTGGGCTGTTTGGTTTGAGCGCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGC 
TCCCGGCCGCCCGCGTCCGCTGGGAATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCT 
GTGGCGGGAAAGCGGCCCCCAGAACCGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGA 
CGAAAACTTGTATGAGAAGAACCCAGACTCCCATGGTTATGACAAGGACCCCGTTTTGGACG 
TCTGGAACATGCGACTTGTCTTCTTCTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACC 
TTTGTGGCCTATCTGCCTGACTACAGGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCT 
TGTGAAATACCGAGAGGCCAATGGCCTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCA 
AGATCCAG 
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FIGURE 237 

GCGGCGGCTATGCCGCTTGCTCTGCTCGTCCTGTTGCTCCTGGGGCCCGGCGGCTGGTGCCT 
TGCAGAACCCCCACGCGACAGCCTGCGGGAGGAACTTGTCATCACCCCGCTGCCTTCCGGGG 
ACGTAGCCGCCACATTCCAGTTCCGCACGCGCTGGGATTCGGAGCTTCAGCGGGAAGGAGTG 
TCCCATTACAGGCTCTTTCCCAAAGCCCTGGGGCAGCTGATCTCCAAGTATTCTCTACGGGA 
GCTGCACCTGTCATTCACACAAGGCTTTTGGAGGACCCGATACTGGGGGCCACCCTTCCTGC 
AGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTTCCAAGACACTGTCACTGATGTGGATAAA 
TCTTTGGAAGGAGCTCAGTAATGTCCTCTCAGGGATCTTCTGCGCCTCTCTCAACTTCATCGA 
CTCCACCAACACAGTCACTCCCACTGCCTCCTTCAAACCCCTGGGTCTGGCCAATGACACTG 
ACCACTACTTTCTGCGCTATGCTGTGCTGCCGCGGGAGGTGGTCTGCACCGAAAACCTCACC 
CCCTGGAAGAAGCTCTTGCCCTGTAGTTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGA 
TCGCTTGTTCCACACCAGCTACCACTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATG 
CACGCTGTACTAGCATCTCCTGGGAGCTGAGGCAGACCCTGTCAGTTGTATTTGATGCCTTC 
ATCACGGGGCAGGGAAAGAAAGACTGGTCCCTCTTCCGGATGTTCTCCCGAACCCTCACGGA 
GCCCTGCCCCCTGGCTTCAGAGAGCCGAGTCTATGTGGACATCACCACCTACAACCAGGACA 
ACGAGACATTAGAGGTGCACCCACCCCCGACCACTACATATCAGGACGTCATCCTAGGCACT 
CGGAAGACCTATGCCATCTATGACTTGCTTGACACCGCCATGATCAACAACTCTCGAAACCT 
CAACATCCAGCTCAAGTGGAAGAGACCCCCAGAGAATGAGGCCCCCCCAGTGCCCTTCCTGC 
ATGCCCAGCGGTACGTGAGTGGCTATGGGCTGCAGAAGGGGGAGCTGAGCACACTGCTGTAC 
AACACCCACCCATACCGGGCCTTCCCGGTGCTGCTGCTGGACACCGTACCCTGGTATCTGCG 
GCTGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAACAAACCAAGTTACATCC 
ACTACCAGCCTGCCCAGGACCGGCTGCAACCCCACCTCCTGGAGATGCTGATTCAGCTGCCG 
GCCAACTCAGTCACCAAGGTTTCCATCCAGTTTGAGCGGGCGCTGCTGAAGTGGACCGAGTA 
CACGCCAGATCCTAACCATGGCTTCTATGTCAGCCCATCTGTCCTCAGCGCCCTTGTGCCCA 
GCATGGTAGCAGCCAAGCCAGTGGACTGGGAAGAGAGTCCCCTCTTCAACAGCCTGTTCCCA 
GTCTCTGATGGCTCTAACTACTTTGTGCGGCTCTACACGGAGCCGCTGCTGGTGAACCTGCC 
GACACCGGACTTCAGCATGCCCTACAACGTGATCTGCCTCACGTGCACTGTGGTGGCCGTGT 
GCTACGGCTCCTTCTACAATCTCCTCACCCGAACCTTCCACATCGAGGAGCCCCGCACAGGT 
GGCCTGGCCAAGCGGCTGGCCAACCTTATCCGGCGCGCCCGAGGTGTCCCCCCACTC2G&TT 
CTTGCCCTTTCCAGCAGCTGCAGCTGCCGTTTCTCTCTGGGGAGGGGAGCCCAAGGGCTGTT 
TCTGCCACTTGCTCTCCTCAGAGTTGGCTTTTGAACCAAAGTGCCCTGGACCAGGTCAGGGC 
CTACAGCTGTGTTGTCCAGTACAGGAGCCACGAGCCAAATGTGGCATTTGAATTTGAATTAA 
CTTAGAAATTCATTTCCTCACCTGTAGTGGCCACCTCTATATTGAGGTGCTCAATAAGCAAA 
AGTGGTCGGTGGCTGCTGTATTGGACAGCACAGAAAAAGATTTCCATCACCACAGAAAGGTC 
GGCTGGCAGCACTGGCCAAGGTGATGGGGTGTGCTACACAGTGTATGTCACTGTGTAGTGGA 
TGGAGTTTACTGTTTGTGGAATAAAAACGGCTGTTTCCGTGGAAAAAAAAAAAA 
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FIGURE 238 

MPLALLVLLLLGPGGWCLAEPPRDSLREELVITPLPSGDVAATFQFRTRWDSELQREGVSHY 
RLFPKALGQLISKYSLRELHLSFTQGFWRTRYWGPPFLQAPSGAELWVWFQDTVTDVDKSWK 
ELSNVLSGIFCASLNFIDSTNTVTPTASFKPLGLANDTDHYFLRYAVLPREVVCTENLTPWK 
KLLPCS S KAGLS VLLKADRLFHTS YHS QAVK I RP VCRNARCTS I SWELRQTLS WFDAF ITG 
QGKKDWSLFRMFSRTLTEPCPLASESRVYVDITTYNQDNETLEVHPPPTTTYQDVILGTRKT 
YAIYDLLDTAMINNSRNLNIQLKWKRPPENEAPPVPFLHAQRYVSGYGLQKGELSTLLYNTH 
PYRAFPVLLLDTVPWYLRLYVHTLTITSKGKENKPSYIHYQPAQDRLQPHLLEMLIQLPANS 
VTKVSIQFERALLKWTEYTPDPNHGFWSPSVLSALVPSMVAAKPVDWEESPLFNSLFPVSD 
GSNYFVRLYTEPLLVNLPTPDFSMPYNVICLTCTWAVCYGSFYNLLTRTFHIEEPRTGGLA 
KRLANLIRRARGVPPL 
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FIGURE 239 

CAACATGGGGTCCAGCAGCTTCTTGGTCCTCATGGTGTCTCTCGTTCTTGTGACCCTGGTGG 
CTGTGGAAGGAGTTAAAGAGGGTATAGAGAAAGCAGGGGTTTGCCCAGCTGACAACGTACGC 
TGCTTCAAGTCCGATCCTCCCCAGTGTCACACAGACCAGGACTGTCTGGGGGAAAGGAAGTG 
TTGTTACCTGCACTGTGGCTTCAAGTGTGTGATTCCTGTGAAGGAACTGGAAGAAGGAGGAA 
ACAAGGATGAAGATGTGTCAAGGCCATACCCTGAGCCAGGATGGGAGGCCAAGTGTCCAGGC 
TCCTCCTCTACCAGGTGTCCTCAGAAATGATGCTGGGTCCTTTCTACCTCTGGGGGTCACTC 
TCACTTGGCACCTGCCCCTGAGGGTCCTGAGACTTGGAATATGGAAGAAGCAATACCCAACC 
CCACCAAAGAAAAC CTGAG CTTGAAGT C CTTT TCCCCAAAAAG AGGGAAG AGTCACAAAAAG 
TCCAGACCCCAGGGACGGTACTTTCCCTCTCTACCTGGTGCTCCTCCCTAATGCTCATGAAT 
GGACCCCTCATGAATGAAACCAGTGCCCTTATAAGAGACCCCAAAGAGCTGCCTTGCCCTTC 
TGCAATGTGTGATCACAGCTAGAAGGCACTGTCAGAGAAGAGAAACTGGTCCTCACCAGATG 
CTGAATCTGCTGGTGCCTTGATCTTGGACTTCCCAGCCTCTAGAACTGTAAGAAATAAATAT 
TTGCTGTTTATAATCCAA 
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FIGURE 240 



MGSSSFLVLMVSLVLVTLVAVSGVXEGIEKAGVCPADNVRCFKSDPPQCHTDQDCLGERKCC 
YLHCGFKCVIPVKELEEGGNKDEDVSRPYPEPGWEAKCPGSSSTRCPQK 



mrmrns 
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FIGURE 241 

AAACTCAGCACTTGCCGGAGTGGCTCATTGTTAAGACAAAGGGTGTGCACTTCCTGGCCAGG 
AAACCTGAGCGGTGAGACTCCCAGCTGCCTACATCAAGGCCCCAGGACATGCAGAACCTTCC 
TCTAGAACCCGACCCACCACCATGAGGTCCTGCCTGTGGAGATGCAGGCACCTGAGCCAAGG 
CGTCCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTCTTTCTCTTCGCCTTGCCCTCTTTTA 
TTAAGGAGCCTCAAACAAAGCCTTCCAGGCATCAACGCACAGAGAACATTAAAGAAAGGTCT 
CTACAGTCCCTGGCAAAGCCTAAGTCCCAGGCACCCACAAGGGCGAGGAGGACAACCATCTA 
TGCAGAGCCAGCGCCAGAGAACAATGCCCTCAACACACAAACCCAGCCCAAGGCCCACACCA 
CCGGAGACAGAGGAAAGGAGGCCAACCAGGCACCGCCGGAGGAGCAGGACAAGGTGCCCCAC 
ACAGCACAGAGGGCAGCATGGAAGAGCCCAGAAAAAGAGAAAACCATGGTGAACACACTGTC 
ACCCAGAGGGCAAGATGCAGGGATGGCCTCTGGCAGGACAGAGGCACAATCATGGAAGAGCC 
AGGACACAAAGACGACCCAAGGAAATGGGGGCCAGACCAGGAAGCTGACGGCCTCCAGGACG 
GTGTCAGAGAAGCACCAGGGCAAAGCGGCAACCACAGCCAAGACGCTCATTCCCAAAAGTCA 
GCAC AGAATG CTGG CT CC CACAGG AG CAGTGTCAACAAGGACGAGACAGAAAGG AGTG AC CA 
CAGCAGTCATCCCACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCCCTGCCCCTTTCCAG 
AGCCCCACGACGCAGAGAAACCAAAGACTGAAGGCCGCCAACTTCAAATCTGAGCCTCGGTG 
GGATTTTGAGGAAAAATACAGCTTCGAAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTG 
TGAAGATCAAAGCCTCCAAGTCGCTGTGGCTCCAGAAACTCTTTCTGCCCAACCTCACTCTC 
TTCCTGGACTCCAGACACTTCAACCAGAGTGAGTGGGACCGCCTGGAACACTTTGCACCACC 
CTTTGGCTTCATGGAGCTCAACTACTCCTTGGTGCAGAAGGTCGTGACACGCTTCCCTCCAG 
TGCCCCAGCAGCAGCTGCTCCTGGCCAGCCTCCCCGCTGGGAGCCTCCGGTGCATCACCTGT 
GCCGTGGTGGGCAACGGGGGCATCCTGAACAACTCCCACATGGGCCAGGAGATAGACAGTCA 
CGACTACGTGTTCCGATTGAGCGGAGCTCTCATTAAAGGCTACGAACAGGATGTGGGGACTC 
GGACATCCTTCTACGGCTTTACCGCCTTCTCCCTGACCCAGTCACTCCTTATATTGGGCAAT 
CGGGGTTTCAAGAACGTGCCTCTTGGGAAGGACGTCCGCTACTTGCACTTCCTGGAAGGCAC 
CCGGGACTATGAGTGGCTGGAAGCACTGCTTATGAATCAGACGGTGATGTCAAAAAACCTTT 
TCTGGTTCAGGCACAGACCCCAGGAAGCTTTTCGGGAAGCCCTGCACATGGACAGGTACCTG 
TTGCTGCACCCAGACTTTCTCCGATACATGAAGAACAGGTTTCTGAGGTCTAAGACCCTGGA 
TGGTGCCCACTGGAGGATATACCGCCCCACCACTGGGGCCCTCCTGCTGCTCACTGCCCTTC 
AGCTCTGTGACCAGGTGAGTGCTTATGGCTTCATCACTGAGGGCCATGAGCGCTTTTCTGAT 
CACTACTATGATACATCATGGAAGCGGCTGATCTTTTACATAAACCATGACTTCAAGCTGGA 
GAGAGAAGTCTGGAAGCGGCTACACGATGAAGGGATAATCCGGCTGTACCAGCGTCCTGGTC 
CCGGAACTGCCAAAGCCAAGAACTGACCGGGGCCAGGGCTGCCATGGTCTCCTTGCCTGCTC 
CAAGGCACAGGATACAGTGGGAATCTTGAGACTCTTTGGCCATTTCCCATGGCTCAGACTAA 
GCTCCAAGCCCTTCAGGAGTTCCAAGGGAACACTTGAACCATGGACAAGACTCTCTCAAGAT 
GGCAAATGGCTAATTGAGGTTCTGAAGTTCTTCAGTACATTGCTGTAGGTCCTGAGGCCAGG 
GATTTTTAATTAAATGGGGTGATGGGTGGCCAATACCACAATTCCTGCTGAAAAACACTCTT 
CCAGTCC7VAAAGCTTCTTGATACAGAAAAAAGAGCCTGGATTTACAGAAACATATAGATCTG 
GTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCTTGAAGGTATTACTTAACTTCACTAC 
AGATTGTCTAGAAGACCTTTCTAGGAGTTATCTGATTCTAGAAGGGTCTATACTTGTCCTTG 
TCTTTAAGCTATTTGACAACTCTACGTGTTGTAGAAAACTGATAATAATACAAATGATTGTT 
GTCCATGGAAAGGCAAATAAATTTTCTACAGTGAAAAAAAAAAAAAAA 
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FIGURE 242 

MRS CLWRCRHLSQGVQWSLLLAVLVFFLFALP S F I KEPQTKPSRHQRTEN I KERSLQSLAKP 

KSQAPTRARRTTIYAEPAPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAW 

KSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEKHQG 

KAATTAKTLIPKSQHRMLAPTGAVSTRTRQKGVTTAVIPPKEKKPQATPPPAPFQSPTTQRN 

QRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDSRHF 

NQSEWDRLEHFAPPFGFMELNYSLVQKVWRFPPVPQQQLLLASLPAGSLRCITCAVVGNGG 

ILNNSHMGQEIDSHDYVFRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVP 

LGKDTOYLHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHMDRYLLLHPD^ 

RYMKNRFLRSKTLDGAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSW 

KRL I FY INHDFKLEREWKRLHDEG 1 1 RLYQRPGPGTAKAKN 



FIGURE 243 

CGATGCGCGGACCCGGGCACCCCCTCCTCCTGGGGCTGCTGCTGGTGCTGGGGCCTTCGCCG 
GAGCAGCGAGTGGAAATTGTTCCTCGAGATCTGAGGATGAAGGACAAGTTTCTAAAACACCT 
TACAGGCCCTCTTTATTTTAGTCCAAAGTGCAGCAAACACTTCCATAGACTTTATCACAACA 
CCAGAGACTGCACCATTCCTGCATACTATAAAAGATGCGCCAGGCTTCTTACCCGGCTGGCT 
GTCAGTCCAGTGTGCATGGAGGATAAGTGAGCAGACCGTACAGGAGCAGCACACCAGGAGCC 
ATGAGAAGTGCCTTGGAAACCAACAGGGAAACAGAACTATCTTTATACACATCCCCTCATGG 
ACAAGAGATTTATTTTTGCAGACAGACTCTTCCATAAGTCCTTTGAGTTTTGTATGTTGTTG 
ACAGTTTGCAGATATATATTCGATAAATCAGTGTACTTGACAGTGTTATCTGTCACTTATTT 



FIGURE 244 

MRGPGHPLLLGLLLVLGPSPEQRVEIVPRDLRMKDKFLKHLTGPLYFSPKCSKHFHRLYHNT 
RDCTI PAYYKR CARLLTRLAVS P VCMEDK 



FIGURE 245 

GGGCTGGGCCCCGCCGCAGCTCCAGCTGGCCGGCTTGGTCCTGCGGTCCCTTCTCTGGGAGG 
CCCGACCCCGGCCGCGCCCAGCCCCCACCATGCCACCCGCGGGGCTCCGCCGGGCCGCGCCG 
CTCACCGCAATCGCTCTGTTGGTG'CTGGGGGCTCCCCTGGTGCTGGCCGGCGAGGACTGCCT 
GTGGTACCTGGACCGGAATGGCTCCTGGCATCCGGGGTTTAACTGCGAGTTCTTCACCTTCT 
GCTGCGGGACCTGCTACCATCGGTACTGCTGCAGGGACCTGACCTTGCTTATCACCGAGAGG 
CAGCAGAAGCACTGCCTGGCCTTCAGCCCCAAGACCATAGCAGGCATCGCCTCAGCTGTGAT 
CCTCTTTGTTGCTGTGGTTGCCACCACCATCTGCTGCTTCCTCTGTTCCTGTTGCTACCTGT 
ACCGCCGGCGCCAGCAGCTCCAGAGCCCATTTGAAGGCCAGGAGATTCCAATGACAGGCATC 
CCAGTGCAGCCAGTATACCCATACCCCCAGGACCCCAAAGCTGGCCCTGCACCCCCACAGCC 
TGGCTTCATGTACCCACCTAGTGGTCCTGCTCCCCAATATCCACTCTACCCAGCTGGGCCCC 
CAGTCTACAACCCTGCAGCTCCTCCTCCCTATATGCCACCACAGCCCTCTTACCCGGGAGCC 
TGAGGAACCAGCCATGTCTCTGCTGCCCCTTCAGTGATGCCAACCTTGGGAGATGCCCTCAT 
CCTGTACCTGCATCTGGTCCTGGGGGTGGCAGGAGTCCTCCAGCCACCAGGCCCCAGACCAA 
GCCAAGCCCTGGGCCCTACTGGGGACAGAGCCCCAGGGAAGTGGAACAGGAGCTGAACTAGA 
ACTATGAGGGGTTGGGGGGAGGGCTTGGAATTATGGGCTATTTTTACTGGGGGCAAGGGAGG 
GAGATGACAGCCTGGGTCACAGTGCCTGTTTTCAAATAGTCCCTCTGCTCCCAAGATCCCAG 
CCAGGAAGGCTGGGGCCCTACTGTTTGTCCCCTCTGGGCTGGGGTGGGGGGAGGGAGGAGGT 
TCCGTCAGCAGCTGGCAGTAGCCCTCCTCTCTGGCTGCCCCACTGGCCACATCTCTGGCCTG 
CTAGATTAAAGCTGTAAAGACAAAA 



FIGURE 246 

MPPAGLRRAAPLTAIALLVIjGAPLV^ 

CRDLTLLITERQQKHCLAFSPKTIAGIASAVILFVAWATTICCFLCSCCYLTORRQQLQSP 
FEGQEIPMTGIPVQPVYPYPQDPKAGPAPPQPGFMYPPSGPAPQYPLYPAGPPVYNPAAPPP 
YMPPQPSYPGA 



